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process in which such semiconductor products or services might 
be or are used. 
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INTRODUCTION 



In this manual, Texas Instruments presents technical information on the Tl 
SN74ACT8800 family of 32-bit processor "building block" circuits. The 
SN74ACT8800 family is composed of single-chip VLSI processor functions, all of which 
are designed for high-complexity processing applications. 

This manual includes specifications and operational information on the following high- 
performance advanced-CMOS devices: 

• SN74ACT8818 16-bit microsequencer 

• SN74ACT8832 32-bit registered ALU 

• SN74ACT8836 32- x 32-bit parallel multiplier 

• SN74ACT8837 64-bit floating point processor 

• SN74ACT8841 Digital crossbar switch 

• SN74ACT8847 64-bit floating point/integer processor 

These high-speed devices operate at or above 20 MHz, while providing the low power 
consumption of Tl's advanced one-micron EPIC™ CMOS technology. The EPIC™ CMOS 
process combines twin- well structures for increased density with one-micron gate 
lengths for increased speed. 

The SN74ACT8800 Family Data Manual contains design and specification data for 
all five devices previously listed and includes additional programming and operational 
information for the '8818, '8832, and '8837/'8847. 

Introductory sections of the manual include an overview of the '8800 family and a 
summary of the software tools and design support Tl offers for the chip-set. The general 
information section includes an explanation of the function tables, parameter 
measurement information, and typical characteristics related to the products listed 
in this volume. 

Package dimensions are given in the Mechanical Data section of the book in metric 
measurement (and parenthetically in inches). 

Complete technical data for any Texas Instruments semicondutor product is available 
from your nearest Tl field sales office, local authorized Tl distributor, or by calling Texas 
Instruments at 1-800-232-3200. 



EPIC is a trademark of Texas Instruments Incorporated. 
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Award Winners/1987 




Electronic Products' 

12th Annual 
Product of the Year 
Awards 

Every year, the editors of Electronic Products 
select what we judge to be the best electron- 
ics products announced during the preceding 
12 months. Our goal: to honor those products that 
have contributed most to help our engineer-readers 
do their jobs better. 

We weigh several criteria as we sift through the 
thousands of products brought to our attention. 
Alone or in combination, each product must repre- 
sent a significant advance in technology or its appli- 
cation, a decided innovation in design, or a sub- 
stantial gain in price-performance ratio. 




• 74ACT8836 Multiplier-Accumulator 

• 74ACT8837 Floating Point Unit 

• 74AS8840 Crossbar Switch 



vii 



viii 



Overview 



SN74ACT8818 16-Bit Microsequencer 



SN74ACT8832 32-Bit Registered ALU 
SN74ACT8836 32- x 32-Bit Parallel Multiplier 



SN74ACT8837 64-Bit Floating Point Processor 











SN74/ 




Digital Crossbar Sw 


itch 



SN74ACT8847 64-Bit Floating Point/Integer Processor 



Support 



Mechanical Data 
_ 



1-1 




o 

< 

CD 

< 

5' 



1-2 



Overview 




1-4 



Introduction 



Texas Instruments SN74ACT8800 family of 32-bit processor building blocks has been 
developed to allow the easy, custom design of functionally sophisticated, high- 
performance processor systems. The '8800 family is composed of single-chip, VLSI 
devices, each of which represents an element of a CPU. 

Geared for computationally intensive applications, SN74ACT8800 devices include high- 
performance ALUs, multipliers, microsequencers, and floating point processors. 

The '8800 chip set provides the performance, functionality, and flexibility to fill the 
most demanding processing needs and is structured to reduce system design cost 
and effort. Most of these high-speed processor functions operate at 20 MHz and above, 
and, at the same time, provide the power savings of Tl's advanced, 1 /xm EPIC™ CMOS 
technology. 

The family's building block approach allows the easy, "pick-and-choose" creation of 
customized processor systems, while the devices' high level of integration provides 
cost-effectiveness. 

Designed especially for high-complexity processing, the devices in the '8800 family 
offer a range of functional options. Device features include three-port architecture, 
double-precision accuracy, optional pipelined operation, and built-in fault tolerance. 

Array/digital signal, image, and graphics processing can be optimized with '8800 
devices. Other applications are found in supermini and fault-tolerant computers, and 
I/O and network controllers. 

In addition to the high-performance, CMOS processor functions featured in this data 
manual, the family includes several high-speed, low-power bipolar support chips. To 
reduce power dissipation and ensure reliabilty, these bipolar devices use Tl's proprietary 
Schottky Transistor Logic (STL) internal circuitry. 



EPIC is a trademark of Texas Instruments Incorporated. 
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At present, Tl's '8800 32-bit processor building block family comprises the following 
functions: 

SN74ACT881 8 16-bit microsequencer 
SN74ACT8832 32-bit registered ALU 
SN74ACT8836 32- x 32-bit parallel multiplier 
SN74ACT8837 64-bit floating point processor 
SN74ACT8841 digital crossbar switch 
SN74ACT8847 64-bit floating point and integer processor 
Bipolar Support Chips 

• SN74AS8833 64-bit funnel shifter 

• SN74AS8834 64 x 40 register file 

• SN74AS8838 32-bit barrel shifter 

• SN74AS8839 32-bit shuffle/exchange network 

• SN74AS8840 16 x 4 crossbar switch 

20 MIPS and Low CMOS Power Consumption 

With instruction cycle times of 50 ns or less and the low power consumption of EPIC™ 
CMOS, the '8800 chip set offers an unrivaled speed/power combination. Unlike 
traditional microprocessors, which require multiple cycles to perform an operation, 
the 'ACT8800 processors typically can complete instructions in a single cycle. 

The 'ACT8832 registered ALU and 'ACT881 8 microsequencer together create a 
powerful 20-MHz CPU. Because instructions can be performed in a single cycle, the 
8832/881 8 combination is capable of executing over 20 million instructions per second 
(MIPS). 

For math-intensive applications, the 'ACT8836 fixed-point multiplier/accumulator 
(MAC), 'ACT8837 64-bit floating point processor, and 'ACT8847 64-bit floating point 
and integer processor offer unprecedented computational power. 

The exceptional performance of the 'ACT8800 family is made possible by Tl's EPIC™ 
CMOS technology. The EPIC™ CMOS process combines twin-well structures for 
increased density with one-micron gate lengths for increased speed. 

Customized Solution 

The '8800 family is designed with a variety of architectural and functional options 
to provide maximum design flexibility. These device features allow the creation of 
''customized" solutions with the '8800 chipset. 

A building block approach to processing allows designers to match specialized hardware 
to their specific design needs. The 8818/8832 combination forms the basis of the 
system, a high-speed CPU. For applications requiring high-speed integer multiplication, 
the 'ACT8836 can be added. To provide the high precision and large dynamic range 
of floating point numbers, the 'ACT8837 or 'ACT8847 can be employed. 
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To ensure speed and flexibility, each component of the '8800 family has three data 
ports. Each data port accommodates 32 bits of data, plus four parity bits. This 
architecture eliminates many of the I/O bottlenecks associated with traditional single- 
1/0 microprocessors. 

The three-port architecture and functional partitioning of the '8800 chip-set opens 
the door to a variety of parallel processing applications. Placing the math and shifting 
functions in parallel with the ALU permits concurrent processing of data. Additional 
processors can be added when performance needs dictate. 

The 'ACT8800 building block processors are microprogrammabie, so that their 
instruction sets can be tailored to a specific application. This high degree of 
programmability offers greater speed and flexibility than a typical microprocessor and 
ensures the most efficient use of hardware. 

A separate control bus eliminates the need for multiplexing instructions and data, further 
reducing processing bottlenecks. The microcode bus width is determined by the 
designer and the application. 

Another source of design flexibility is provided by the pipelined/flowthrough operation 
option. Pipelining can dramatically reduce the time required to perform iterative, or 
sequential, calculations. On the other hand, random or nonsequential algorithms require 
fast flowthrough operations. The '8800 chip set allows the designer to select the mode 
(fully pipelined, partially pipelined, or nonpipelined) most suited to each design. 

Scientific Accuracy 

The '8800 family is designed to support applications which require double-precision 
accuracy. Many scientific applications, such as those in the areas of high-end graphics, 
digital signal processing, and array processing, require such accuracy to maintain data 
integrity. In general-purpose computing applications, floating point processors must 
often support double-precision data formats to maintain compatibility with existing 
software. 

To ensure data integrity, '8800 devices (excluding the barrel shifter and 
microsequencer) support parity checking and generation, as well as master/slave error 
detection. Byte parity checking is performed on the input ports, and a parity generator 
and a master/slave comparator are provided at the output. Fault tolerance is built into 
the processors, ensuring correct device operation without extra logic or costly software. 
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The SN74ACT8800 Building Block Processor System 

■ Some of the high-performance '8800 devices are described in the following paragraphs. 
SN74ACT881 8 16-Bit Microsequencer 

■ o 

< In a high-performance microcoded system, a fast microcode controller is required to 
5 control the flow of instructions. The SN74ACT8818 is a high-speed, versatile 16-bit 
£. microsequencer capable of addressing 64K words of microcode memory. The 
2 'ACT8818 can address the next instruction fast enough to support a 50-ns system 
cycle time. 

The 'ACT881 8 65-word-deep by 16-bit-wide stack is useful for storing subroutine 
return addresses, top of loop addresses, and loop counts. Addresses can be sourced 
from eight different sources: the three I/O ports, the two register counters, the 
microprogram counter, the stack, and the 16-way branch. 

SN74ACT8832 Registered ALU 

The SN74ACT8832 is a 32-bit registered ALU that operates at approximately 20 Mhz. 
Because instructions can be performed in a single cycle, the 'ACT8832 is capable of 
executing 20 million microinstructions per second. An on-board 64-word register file 
is 36-bits-wide to permit the storage of parity bits. The 3-operand register file increases 
performance by enabling the creation of an instruction and the storage of the previous 
result in a single cycle. To facilitate data transfer, operands stored in the register file 
can be accessed externally, while the ALU is executing. To support the parallel 
processing of data, the 'ACT8832 can be configured to operate as four 8-bit ALUs, 
two 1 6-bit ALUs, or a single 32-bit ALU. The 'ACT8832 incorporates 32-bit shifters 
for double-precision shift operations. 

SN74ACT8836 32- x 32-Bit Integer MAC 

The SN74ACT8836 is a 32-bit integer multiplier/accumulator (MAC) that accepts two 
32-bit inputs and computes a 64-bit product. The device can also operate as a 64-bit 
by 64-bit multiplier. An onboard adder is provided to add or subtract the product or 
the complement of the product from the accumulator. 

When pipelined internally, the 1 /im CMOS parallel MAC performs a full 32- x 32-bit 
multiply/accumulate in a single 36-ns clock cycle. In flowthrough mode (without any 
pipelining), the 'ACT8836 takes 60 ns to multiply two 32-bit numbers. The ' ACT8836 
performs a 64- x 64-bit multiply/accumulate, outputting a 64-bit result, in 225 ns. 

The * ACT8836 can handle a wide variety of data types, including two's complement, 
signed, and mixed. Division is supported via the Newton-Raphson algorithm. 

SIM74ACT8837 64-Bit Floating Point Unit 

The SN74ACT8837 is a high-speed floating point processor. This single-chip device 
performs 32- or 64-bit floating point operations. 
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More than just a coprocessor, the 'ACT8837 integrates on one chip a double-precision 
floating point ALU and multiplier. Integrating these functions on a single chip reduces 
data routing problems and processing overhead. In addition, three data ports and a 
64-bit internal bus architecture allow for single-cycle operations. 

The 'ACT8837 can be pipelined for iterative calculations or can operate with input 
registers disabled for low latency. 

SN74ACT8841 Digital Crossbar Switch 

The SN74ACT8841 is a single-chip digital crossbar switch. The high-performance 
device, cost-effectively eliminates bottlenecks to speed data through complex bus 
architecture. 

The 'ACT8841 is ideal for multiprocessor applications, where memory bottlenecks 
tend to occur. The device has 64 bidirectional I/O ports that can be configured as 1 6 
4-bit ports, 8 8-bit ports, or 4 16-bit ports. Each bidirectional port can be connected 
in any conceivable combination. Any single input port can be broadcast to any 
combination of output ports. The total time for data transfer is 20 ns. 

The control sources for ten separate switching configurations are on-chip, including 
eight banks of programmable control flip-flops and two hard-wired control circuits. 

The EPIC™ CMOS SN74ACT8841 and its predecessor, SN74AS8840, are based on 
the same architecture, differing in power consumption, number of control registers, 
and pin-out. Microcode written for the 'AS8840 can be run on the 'ACT8841 . 

SN74ACT8847 64-Bit Floating Point Unit 

The SN74ACT8847 is a high-speed 64-bit floating point processor. The device is fully 
compatible with IEEE standard 754-1985 for addition, subtraction, multiplication, 
division, square root, and comparison. Division and square root operations are 
implemented via hardwired control. 

The SN74ACT8847 FPU also performs integer arithmetic, logical operations, and logical 
shifts. Registers are provided at the inputs, outputs, and inside the ALU and multiplier 
to support multilevel pipelining. These registers can be bypassed for nonpipelined 
operations. 

When fully pipelined, the 'ACT8847 can perform a double-precision floating point or 
32-bit integer operation in under 40 ns. When in flowthrough mode, the 'ACT8847 
takes less than 100 ns to perform an operation. 

Bipolar Support Chips 

The SN74AS8833 64-bit-to-32-bit funnel shifter can increase overall speed in systems 
where multi-bit shift operations and field masking are frequently used. The device can 
perform logical, circular, and arithmetic shifts on 32-bit and 64-bit words, IEEE or IBM 
normalization, and field pack or extract operations. The 'AS8833 provides 
shift/mask/merge capability for graphics and data compression applications. 
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The SN74AS8834 is a high-speed, three-operand, 64-word by 40-bit register file. 

Designed to expand the 'ACT8832 register file, the 'AS8834 is an ideal temporary 
eI storage device for high-speed applications. Four address ports, two write and two 
mm read, operate independently to support MSH/LSH swap operations. 

O The SN74AS8838 high-speed, 32-bit barrel shifter can shift up to 32 bits in a single 
< instruction cycle of under 25 ns. Five basic shifts can be programmed: circular left, 
^ circular right, logical left, logical right, and arithmetic right. The ' AS8838 offloads the 
gjv responsibility for shifting operations from the ALU, which increases shifter functionality 
$ and system throughput. 

The SN74AS8839 is a 32-bit shuffle/exchange network. The high-speed device can 
perform data permutations on one 32-bit, two 16-bit, four 8-bit, or eight 4-bit data 
words in a single instruction cycle of under 25 ns. The shuffle/exchange network is 
designed primarily for use in digital signal processing applications. 
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SN74ACT8818 
16-Bit Microsequencer 

Addresses Up to 64K Locations of Microprogram Memory 

CLK-to-Y > 30 ns (t pc |) 

Low-Power EPIC ™ CMOS 

00 

Addresses Selected from Eight Different Sources CO 

Performs Multiway Branching, Conditional Subroutine Calls, and Nested O 
Loops ^ 

Large 64- Word by 16-bit Stack 2 



00 



• Cascadable 

Because they're microprogrammable, the ACT8800 building block processors provide 
greater speed and flexibility than does a typical microprocessor. In such a high- 
performance microcoded system, a fast microsequencer is required to control the flow 
of microinstructions. 

The SN74ACT8818 is a high-speed, versatile 16-bit microsequencer capable of 
addressing 64K words of microcode memory. The 'ACT8818 can address the next 
instruction fast enough to support a 50-ns system cycle time. 

The 'ACT8818 65-word-deep by 16-bit-wide stack is useful for storing subroutine 
return addresses, top-of-loop addresses, and loop counts. For added flexibility, 
addresses can be selected from eight different sources: the three I/O ports, the two 
register/counters, the microprogram counter, the stack, and the 1 6-way branch input. 



CO 



EPIC is a trademark of Texas Instruments Incorporated. 
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Introduction 



The SN74ACT8818 microsequencer is a low-power, high-performance microsequencer 
implemented in 7Ts EPIC™ Advanced CMOS technology. The 1 6-bit device addresses 
up to 64K locations of microprogram memory and is compatible with the SN74AS890 
microsequencer. 

The 'ACT8818 performs a range of sequencing operations in support of Tl's family oo 
of building block devices and special-purpose processors such as the SN74ACT8847 JJj 
Floating Point Unit (FPU). 

Understanding the 'ACT8818 Microsequencer U 

The 'ACT8818 microsequencer is designed to control execution of microcode in a J* 
microprogrammed system. Basic architecture of such a system usually incorporates 
at least the microsequencer, one or more processing elements such as the 'ACT8847 (/) 
FPU or the SN74ACT8832 Registered ALU, microprogram memory, microinstruction 
register, and status logic to monitor system states and provide status inputs to the 
microsequencer. 

The ' . ACT88 1 8 combines flexibility and high speed in a microsequencer that performs 
multiway branching, conditional subroutine calls, nested loops, and a variety of other 
microprogrammable operations. The 'ACT8818 can also be cascaded for providing 
additional register/counters or addressing capability for more complex microcoded 
control functions. 

In this microsequencer, several sources are available for microprogram address 
selection. The primary source is the 16-bit microprogram counter (MPC), although 
branch addresses may be input on the two 1 6-bit address buses, DRA and DRB. An 
address input on the DRA bus can be pushed on the stack for later selection. 
Register/counters RCA and RCB can store either branch addresses or loop counts as 
needed, either for branch operations or for looping on the stack. 

The selection of address source can be based on external status from the device being 
controlled, so that three-way or multiway branching is supported. Once selected, the 
address which is output on the Y bus passes to the microprogram memory, and the 
microinstruction from the selected location is clocked into the pipeline register at the 
beginning of the next cycle. 

It is also possible to interrupt the 'ACT8818 by placing the Y output bus in a high- 
impedance state and forcing an interrupt vector on the Y bus. External logic is required 
to place the bus in high impedance and load the interrupt vector. The first 



EPIC is a trademark of Texas Instruments Incorporated. 
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microinstruction of the interrupt handler subroutine can push the address from the 
Interrupt Return register on the stack so that proper linkage is preserved for the return 
from subroutine. 

Microprogramming the 'ACT88 18 

Microinstructions for the 'ACT8818 select the specific operations performed by the 
mm Y output multiplexer, the register/counters RCA and RCB, the stack, and the 
KJ bidirectional DRA and DRB buses. Each set of inputs is represented as a separate field 
*™ in the microinstructions, which control not only the microsequencer but also the ALU 
(/> or other devices in the system. 

%4 The 3-port architecture of the /ACT881 8 facilitates both branch addressing and 

^ register/counter operations. Both register/counters can be used to hold either loop 

q counts or branch addresses loaded from the DRA and DRB buses. Register/counter 

H operations are selected by control inputs RC2-RC0. 
00 

00 Similarly, the 65-word by 16-bit stack can save addresses from the DRA bus, the 
00 microprogram counter (MPC), or the Interrupt Return register, depending on the settings 
of stack controls S2-S0 and related control inputs. Flexible instructions such as Branch 
DRA else Branch to Stack else Continue can be coded to take advantage of the 
conditional branching capability of the 'ACT8818. 

Multiway branching ( 1 6- or 32-way) uses the B3-B0 inputs to set up a 1 6-way branch 
address on DRA or DRB by concatenating B3-B0 with the upper 12 bits of the DRA 
or DRB bus. The resulting branch addresses DRA' (DRA1 5-DRA4::B3-B0) and DRB' 
(DRB1 5-DRB4::B3-B0) are selected by the Y output multiplexer controls MUX2-MUX0. 
A Branch DRB' else Branch DRA' instruction can select up to 32 branch addresses, 
as determined by the settings of B3-B0. 

Design Support 

Texas Instruments Regional Technology Centers, staffed with systems-oriented 
engineers, offer a training course to assist users of Tl's LSI products and their 
application to digital processor systems. Specific attention is given to the understanding 
and generation of design techniques which implement efficient algorithms designed 
to match high-performance hardware capabilities with desired performance levels. 
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Information on LSI devices and product support can be obtained from the following 
Regional Technology Centers: 



Atlanta 

Texas Instruments Incorporated 
3300 N.E. Expressway, Building 8 
Atlanta/ GA 30341 
404/662-7945 

Boston 

Texas Instruments Incorporated 
950 Winter Street, Suite 2800 
Waltham, MA 02154 
617/895-9100 

Northern California 

Texas Instruments Incorporated 

5353 Betsy Ross Drive 

Santa Clara, CA 95054 

408/748-2220 



Chicago 

Texas Instruments Incorporated 

51 5 Algonquin 

Arlington Heights, IL 60005 

312/640-2909 

Dallas 

Texas Instruments Incorporated 
10001 E. Campbell Road 
Richardson, TX 75081 
214/680-5066 

Southern California 

Texas Instruments Incorporated 

17891 Cartwright Drive 

Irvine, CA 92714 

714/660-8140 
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Design Expertise 

Texas Instruments can provide in-depth technical design assistance through 
consultations with contract design services. Contact the local Field Sales Engineer 
for current information or contact VLSI Systems Engineering at 214/997-3970. 
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Figure 1. ACT8818 ...... GC Package 
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Table 1. 'ACT8818 Pin Grid Allocation 





PIN 




PIN 




PIN 




PIN 


NO. 


NAME 


NO. 


NAME 


NO. 


NAME 


NO. 


NAME 


A2 


RC2 


C2 


RCO 


F3 


RBOE 


J10 


S1 


A3 


Y1 


C3 


GND 


F9 


BO 


J11 


STKWRN/RER 


A4 


Y3 


C5 


GND 


F10 


B1 


K1 


DRBO 


A5 


Y5 


C6 


Y7 


F11 


MUX2 


K2 


SELDR 


A6 


Y6 


C7 


Y10 


G1 
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K5 
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A9 


Y13 
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RE 


G9 


GLK 


K6 


DRA7 


A10 


NC 


D1 


DRB12 


G10 


MUXO 


K7 


DRA5 


B1 


DRB15 


D2 


DRB13 


G11 


MUX1 


K8 


DRA3 


B2 


RC1 


D9 


GND 


H1 


DRB4 


K9 


DRAO 


B3 


YO 


D10 


COUT 


H2 


DRB3 


K10 


SO 


B4 


Y2 


D11 


INC 


H10 


CC 


K11 


S2 


B5 


Y4 


E1 


DRB9 


H11 


ZEROUT 


L2 


DRA15 


B6 


YOE 


E2 
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E3 
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E9 
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DRA1 


L8 
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C1 
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F2 
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J9 


GND 


L9 
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Table 2. ACT881 8 Pin Functional Description 



PIN 
NAME 


GC 
NO. 


FN 
NO. 


I/O 


DESCRIPTION 


BO 


F9 


22 






B1 


F10 


23 


I 


Input bits for branch addressing (see Table 3) 


B2 


E11 


24 






B3 


E10 


25 






CLK 


G9 


18 




System clock 










Incrementer carry-out. Goes high when an attempt is 


COUT 


D10 


28 


0 


made to increment microprogram counter beyond 
addressable micromemory. 


cc 


H10 


15 


I 


Condition code 


DRAO 


K9 


9 






DRA1 


J8 


8 






DRA2 


L9 


7 






DRA3 


K8 


6 






DRA4 


L8 


5 






DRA5 


K7 


4 






DRA6 


L7 


3 






DRA7 


K6 


2 


I/O 


Bidirectional DRA data port. Outputs data from 


DRA8 


L6 


84 




stack or register/counter A (RAOE = 0) or inputs 


DRA9 


L5 


83 




external data (RAOE = 1). 


DRA10 


K5 


82 






DRA11 


L4 


80 






DRA12 


K4 


79 






DRA13 


L3 


78 






DRA14 


K3 


77 






DRA15 


L2 


76 






DRBO 


K1 


73 






DRB1 


J2 


72 






DRB2 


J1 


71 






DRB3 


H2 


70 




Bidirectional DRB data port. Outputs data from 


DRB4 


H1 


69 


I/O 


register/counter B 


DRB5 


G2 


67 


(RBOE = 0) or inputs external data 


DRB6 


G1 


66 






DRB7 


FT 


65 






DRB8 


F2 


63 






DRB10 


E2 


61 
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Table 2. ACT8818 Pin Functional Description (Continued) 



PIN 
NAME 


GC 
NO 


FN 
NO. 


I/O 


DESCRIPTION 


DRB11 
DRB12 
DRB13 
DRB14 
DRB15 


E3 
D1 
D2 
C1 
B1 


60 
59 
58 
57 
56 


I/O 


Bidirectional DRB data port. Outputs data from 
register/counter B (RBOE = 0) or inputs external data 
(RBOE = 1). 


GND 
GND 
GND 
GND 
GND 
GND 
GND 


C3 
C5 
C9 
D9 
G3 
J 5 
J9 


10 
30 
33 
46 
52 
68 
81 




Ground pins. All pins must be used. 


INC 


DTI 


27 


I 


Incrementer control pin 


INT 


E9 


26 


I 


Selects INT RT register to stack, active low (see 
Table 3) 


MUXO 
MUX1 
MUX2 


G10 
G11 
F11 


19 
20 
21 


I 


MUX control for Y output bus (see Table 4) 


OSEL 


L10 


11 


I 


DRA output MUX select. Low selects RCA, high 
selects stack. 


RAOE 


J6 


1 


I 


DRA output enable, active low 


RBOE 


F3 


64 


I 


DRB output enable, active low 


RCO 
RC1 
RC2 


C2 
B2 
A2 


55 
54 
53 


I 


Controls for register/counters A and B 


RE 


C11 


29 


I 


INT RT register enable, active low. A high input holds 
INT RT register while a low input passes Y to INT RT 
register (see Table 3). 


SO 
S1 
S2 


K10 
J10 
K11 


12 
13 
14 


I 


Stack controls 


SELDR 


K2 


75 


I 


Selects data source to DRA bus and DRB bus (See 
Table 3) 


STKWRN/ 
RER 


J11 


16 


0 


Stack warning signal flag 


v C c 
v C c 


C10 
J3 


31 
74 




Supply voltage (5 V) 
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Table 2. ACT8818 Pin Functional Description (Concluded) 



00 



PIN 
NAME 


GC 
NO. 


FN 
NO. 


l/U 


DcbuH IPTION 


Y0 


B3 


51 






Y1 


A3 


50 






Y2 


B4 


49 






Y3 


A4 


48 






Y4 


B5 


47 






Y5 


A5 


45 






Y6 


A6 


44 






Y7 


C6 


43 


I/O 


Bidirectional Y data port 


Y8 


A7 


41 






Y9 


B7 


40 






Y10 


C7 


39 






Y11 


A8 


38 






Y12 


B8 


37 






Y13 


A9 


36 






Y14 


B9 


35 






Y15 


B10 


34 






YOE 


B6 


42 


I 


Y output enable, active low 


ZEROIN 


B11 


32 


I 


Forces internal zero detect high 


ZEROUT 


H11 


17 


0 


Outputs register/counter zero detect signal 
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'ACT8818 Specification Tables 



absolute maximum ratings over operating free air temperature range (unless 
otherwise noted) *!* 



Supply voltage, Vqc -0.5 V to 6 V 

Input clamp current, l||< (V|<0 or V|>Vcc) ■ ±20 mA 

Output clamp current, Iok (Vfj<0 or Vq>Vcc • ±50 mA PJI 

Continuous output current, \q (Vrj = 0 to Vqc) ±50 mA Q 

Continuous current through Vrjc or GND pins ±100 mA 

Operating free-air temperature range . . . . 0°C to 70 °C 2? 

Storage temperature range 65 °C to 150 °C g 

* Stresses beyond those listed under "absolute maximum ratings" may cause permanent damage to the device. ^ 

These are stress ratings only and functional operation of the device at these or any other conditions beyond ^ 

those indicated under "recommended operating conditions" is not implied. Exposure to absolute maximum ^ 

rated conditions for extended periods may affect device reliability. P*** 

z 

recommended operating conditions CO 



PARAMETER 


MIN IMOM MAX 


UNIT 


Vfjc Supply voltage 


4.5 5 5.5 


V 


V|h High-level input voltage 


2 V CC 


V 


V|l Low-level input voltage 


0 0.8 


V 


'OH High-level output current 


-8 


mA 


IfjL Low-level output current 


8 


mA 


Vj Input voltage 


0 V CC 


V 


Vq Output voltage 


o v C c 


V 


dt/dv Input transition rise or fall rate 


0 15 


ns/V 


TA Operating free-air temperature 


0 70 


°C 
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electrical characteristics over recommended operating free-air temperature 
range (unless otherwise noted) 



CO 



00 



DADAMCTCD 

rARAMcTcn 


TtCT /T"4MrMT"I/"\MC 

Tcbl UUNUlliUIMo 


V CC 


TA - 25°C 


aaiKI TVD IIAV 
MIIM I TP MAA 


1 IMIT 
UIMI 1 


MIN TYP MAX 


V OH 


■OH = ~ 20 fi/\ 


4.5 V 


4.48 




V 


5.5 V 


5.46 




Iqh = -8 mA 


4.5 V 


4.15 


3.76 


5.5 V 


4.97 


4.76 




l 0L = 20 iiA 


4.5 V 


0.014 




V 


5.5 V 


0.014 




'OL 8 mA 


4.5 V 


0.15 


0.45 


5.5 V 


0.13 


0.45 


»l 


V, = V CC or 0 


5.5 V 




± 1 


(iA 


j cc 


V| = V cc or0 


5.5 V 


98 


200 


>A 


Ci 


V, = V CC or 0 


5 V 


3 




PF 




One input at 3.4 V, other 
inputs at 0 or Vqc 


5.5 V 




1 


mA 



This is the increase in supply current for each input that is at one of the specified TTL voltage levels rather 
than 0 V or Vqc- 
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maximum switching characteristics 



PARAMETER 


FROM 
(INPUT) 


TO 
(OUTPUT) 


UNIT 


Y ZEROUT DRA DRB STKWRN COUT 


tpd 


cc 


23 




CLK 


27 24 16 25 
30^ 23 f 




DRA1 5-DRAO 


23 


ns 


DRB1 5-DRBO 


22 


MUX2-MUXO 


22 


RC2-RCO 


26 18 


S2-S0 


25 19 


B3-B0 


19 


OSEL 


25 20 


ZEROIN 


25 


SELDR 


23 


INC 


20 


Y 


16 


ten 


YOE 


16 


ns 


RAOE 


18 


RBOE 


17 


*dis 


YOE 


14 


ns 


RAOE 


13 


RBOE 


14 



Decrementing register/counter A or B and sensing a zero. 
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setup and hold times 



Z 



00 



PARAMETER 


FROM (INPUT) 


TO (OUTPUT) 


MIN 


MAX 


UNIT 




CC 


Stack 


15 








Stack 


9 






DRA1 5-DRAO 


RCA 


6 








INT RT 


9 






DRR1 R-DRRO 


RCB 


7 






INT RT 


11 






INC 


MPC 


7 






INT 


Stack 


7 








Stack 


15 






RC2-RC0 


RCA, RCB 


6 








INT RT 


16 




L su 


S2-S0 


Stack 


13 




INT RT 


13 




UOCL 


Stack 


12 






INT RT 


13 






DO DU 


Stack 


8 






INT RT 


14 






SELDR 


Stack 


10 






INT RT 


10 






ZEROIN 


Stack 


14 






INT RT 


13 






Y 


MPC 


6 






RE 


INT RT (CLK) 


7 






MUX2-MUX0 


INT RT 


12 




th 


Any 
Input 


Any 
Destination 


0 


ns 



clock requirements 



PARAMETER 


MIN MAX 


UNIT 


t w -| Pulse duration, clock low 


7 


ns 


t w 2 Pulse duration, clock high 


9 


ns 


t c Clock cycle time 


33 


ns 
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Architecture 



The 'ACT8818 microsequencer is designed with a 3-port architecture similar to the 
bipolar SN74AS890 microsequencer. Figure 4 shows the architecture of the 
'ACT8818. The device consists of the following principal functional groups: 

1. A 16-bit microprogram counter (MPC) consisting of a register and 
incrementer which generates the next sequential microprogram address 

2. Two register/counters (RCA and RCB) for counting loops and iterations, 
storing branch addresses, or driving external devices 

3 . A 65-word by 1 6-bit LIFO stack which allows subroutine calls and interrupts 00 
at the microprogram level and is expandable and readable by external qq 
hardware 00 

4 . An interrupt return register and Y output enable for interrupt processing at O 

the microinstruction level ^ 

*T 

5. AY output multiplexer by which the next address can be selected from MPC, ^ 
RCA, RCB, external buses DRA and DRB, or the stack. ^ 

'ACT8818 control signals are summarized in Table 3. Those signals, which typically 
originate from the instruction register, are Y output multiplexer controls, MUX2-MUX0. 
These select the source of the next address; stack operation controls, S2-SO; 
register/counter operation controls, RC2-RC0; OSEL, which allows the stack to be 
read for di agnostics; input MUX select, SELDR; DRA and DRB output enables, RACE 
and RBOE; and INT, used during the first cycle of interrupt service routines to push 
the address in the interrupt return register address onto the stack. 

Control and data signals that commonly originate from the microinstruction and from 
other hardware sources include INC, which determines whether to increment the MPC; 
DRA and DRB, used to load or read loop counters and/or next addresses; and CC, 
the condition code input. The address being loaded into the MPC is not incremented 
if INC is low, allowing wait states and repeat until flag instructions to be implemented. 
If INC originates from status, repeat until flag instructions are possible. 

The condition code input CC typically originates from ALU status to permit test and 
branch instructions. However, it must also be asserted under microprogram control 
to implement other instructions such as continue or loop. Therefore, CC will generally 
be controlled by the output of a status multiplexer. In this case, whether CC is to 
be forced high, forced low or taken from ALU status will be determined by a status 
MUX select field in the microinstruction. 
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Table 3. Response to Control Inputs 



SIGNAL 


LOGIC LEVEL 


NAME 


HIGH 


LOW 


B0 t 


Load stack pointer from 7 least 
significant bits of DRA 


No effect 


Bit 


Selects DRA contents as stack 
input (takes priority over INT) 


No effect 


CC 


Condition code input. May be 
microcoded or selected from 
external status results. 


Condition code input. For branch 
operations, low active. 


INC 


Increment address from Y bus and 
load into MPC 


Pass address from Y bus to MPC 
unincremented. 


IIM 1 + 


Selects MPC as input to stack 


Selects interrupt return register as 
input to stack 


OSEL 


Selects stack as output from DRA 
output MUX 


Selects RCA as output from DRA 
output MUX 


MUX2-MUX0 


See Table 4 


See Table 4 


RAOE 


DRA output disabled (high-Z) 


DRA output enabled 


RBOE 


DRB output disabled (high-Z) 


DRB output enabled 


RC2-RC0 


See Table 6 


See Table 6 


RE 


Hold interrupt return register 
contents 


Load address on Y bus to interrupt 
return register 


S2-S0 


See Table 5 


See Table 5 


SELDR 


Selects DRA/DRB external data as 
inputs to DRA/DRB buses 


Selects RCA (OSEL low) or stack 
(OSEL high) to DRA bus, RCB to 
DRA bus 


YOE 


Y output disabled (high-Z) 


Y output enabled 


ZEROIN 


Sets ZERO to a high externally to 
set up conditional branch 


No effect 



tNo control effect when DRA' or DRB' selected (MUX2-MUX0) 
*When B1 is low or B1 is not in control mode. 



= HLH) because B3-B0 are address inputs. 



Control signals which may also originate from hardware are B3-B0, which can be used 
as a 4-bit status input to support 16- and 32- way branches, and YOE, which allows 
interrupt hardware to force an interrupt vector on the microaddress bus. 
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Figure 4. ACT8818 Functional Block Diagram 



Status from the 'ACT8818 is provided by ZEROUT, which is set at the beginning of 
a cycle in which either of the register/counters will decrement to zero, and 
STKWRN/RER, set at the beginning of the cycle in which the bottom of stack is read 
or in which the next to last location is written. In the latter case, STKWRN/RER remains 
high until the stack pointer is decremented from 64 to 63, 
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Y Output Multiplexer 



Address selection is controlled by the Y output multiplexer and the RAOE and RBOE 
enables. Addresses can be selected from eight sources: 

The microprogram counter register, used for repeat (INC off) and continue 
(INC on) instructions 

The stack, which supports subroutine calls and returns as well as iterative 
loops and returns from interrupts 

The DRA and DRB ports, which provide two additional paths from external 
hardware by which microprogram addresses can be generated 

Register counters RCA and RCB, which can be used for additional address 
storage 

B3-B0, whose contents can replace the four least significant bits of the 
DRA and DRB buses to support 1 6-way and 32-way branches 

An external input onto the bidirectional Y port to support external 
interrupts. 

Use of controls MUX2-MUX0 is explained further in the later section on 
microprogramming the 'ACT88 18. 

Microprogram Counter 

Based on system status and the current instruction, the microsequencer outputs the 
next execution address in the microprogram. Usually the incrementer adds one to the 
address on the Y bus to compute next address plus one. Next address plus one is 
stored in the microprogram register at the beginning of the subsequent instruction cycle. 
During the next instruction, this 'continue' address will be ready at the Y output MUX 
for possible selection as the source of the subsequent instruction. The incrementer 
thus looks two addresses ahead of the address in the instruction register to set up 
a continue (increment by one) or repeat (no increment) address. 

Selecting INC from status is a convenient means of implementing instructions that 
must repeat until some condition is satisfied; for example, Shift ALU Until MSB = 1 , 
or Decrement ALU Until Zero. The MPC is also the standard path to the stack. The 
next address is pushed onto the stack during a subroutine call, so that the subroutine 
will return to the instruction following that from which it was called. 

Register/Counters 

Addresses or loop counts may be loaded directly into register/counters RCA and RCB 
through the direct data ports DRA1 5-DRAO and DRB1 5-DRBO. The values stored in 
these registers may either be held, decremented, or read. Independent control of both 
the registers during a single cycle is supported with the exception of a simultaneous 
decrement of both registers. 
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Stack 

The positive edge clocked 1 6-bit address stack allows multiple levels of nested calls 
or interrupts and can be used to support branching and looping. Seven stack operations 
are possible: 

1 . Reset, which pulls all Y outputs low and clears the stack pointer and read 
pointer 

2 . Clear, which sets the stack pointer and read pointer to zero 

3 . Pop, which causes the stack pointer to be decremented 

4 . Push, which puts the contents of the MPC, interrupt return register, or 
DRA bus onto the stack and increments the stack pointer 

5 . Read, which makes the address indicated by the read pointer available 
at the DRA port 

6 . Hold, which causes the address of the stack and read pointers to remain 

unchanged 

7 . Load stack pointer, which inputs the seven least significant bits of DRA 

to the stack pointer. 

Stack Pointer 

The stack pointer (SP) operates as an up/down counter; it increments whenever a push 
occurs and decrements whenever a pop occurs. Although push and pop are two event 
operations (store then increment SP, or decrement SP then read), the 'ACT8818 
performs both events within a single cycle. 

Read Pointer 

The read pointer (RP) is provided as a tool for debugging microcoded systems. It permits 
a nondestructive, sequential read of the stack contents from the DRA port. This 
capability provides the user with a method of backtracking through the address 
sequence to determine the cause of overflow without affecting program flow, the status 
of the stack pointer, or the internal data of the stack. 

Stack Warning/Read Error Pin 

A high signal on the STKWRN/RER pin indicates a potential stack overflow or underflow 
condition. STKWRN/RER becomes active under two conditions. If 62 of the 65 stack 
locations (0-67) are full (the stack pointer is at 62) and a push occurs, the STKWRN/RER 
pin outputs a high signal to warn that the stack is approaching its capacity and will 
be full after two more pushes. 

The STKWRN/RER signal will remain high if hold, push or pop instructions occur, until 
the stack pointer is decremented to 63. If a push instruction is attempted when the 
stack is full, the new address will be ignored and the old address in stack location 
64 will be retained. 
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The second condition in which the STKWRN/RER signal goes high is to indicate that 
the last location has been popped from the stack and the stack is empty. The user 
may be protected from attempting to pop an empty stack by monitoring STKWRN/RER 
before pop operations. A high level at this pin signifies that the last address has been 
removed from the stack (SP = 0). This condition remains until an address is pushed 
onto the stack and the stack pointer is incremented to one. 

EM Interrupt Return Register 

^™ Unlike the MPC register, which normally gets next address plus one, the interrupt return 
(/) register simply gets next address. This permits interrupts to be serviced with zero 
2j latency, since the interrupt vector replaces the pending address. 

^ The interrupting hardware disables the Y output and forces the vector onto the 

q microaddress bus. This event must be synchr oniz ed with the system clock. The first 

H address of the service routine must program INT low and perform a push to put the 

qq contents of the interrupt return register on the stack. 

-a 

CD 
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Microprogramming the 'ACT8818 



Y 




DRA 








Y 




DRA' 








Y 




DRB 








Y 




DRB' 








Y 




STK 








Y 




DRA; STK - 


MPC; SP - 


SP 


+ 1 


Y 




DRB; STK - 


MPC; SP - 


SP 


+ 1 


Y 




DRA'; STK - 


MPC; SP - 


■ SP 


+ 1 


Y 




DRB'; STK - 


MPC; SP - 


SP 


+ 1 


Y 




STK; STK - 


MPC; SP - 


SP 


+ 1 



< 

Z 



Microprogramming is unlike programming monolithic processors for several reasons. 
First, the width of the microinstuction word is only partially constrained by the basic 
signals required to control the sequencer. Since the main advantage of a 
microprogrammed processor is speed, many operations are often supported by or 
carried out in special purpose hardware. Lookup tables, extra registers, address 
generators, elastic memories, and data acquisition circuits may also be controlled by 
the microinstruction. 

The number of slices in a bit-slice ALU is user-defined, which makes the microinstruction CO 
width even more application dependent. Types of instructions resulting from 
manipulation of the sequencer controls are discussed below. Examples of some 00 
commonly used instructions can be found in the later section of microinstructions and H 
flow diagrams. The following abbreviations are used in the tables in this section: 

BR A 
BR A' 
BR B 
BR B' 
BR S 
CALL A 
CALL B 
CALL A' 
CALL B' 
CALL S 

CLR SP, RP SP - 0; RP - 0 

CONT/RPT Y - MPC + 1 if INC = H; Y - MPC if INC = L 
DRA Bidirectional data port (can be loaded externally or from RCA) 

DRA15-DRA4::B3-B0 

Bidirectional data port (can be loaded externally or from RCB) 
DRB15-DRB4::B3-B0 
Microprogram counter 
SP - SP - 1 

STK - MPC; SP - SP + 1 
Register/counter A 
Register/counter B 

Y ~ STK; RP - RP 

Y - 0; SP - 0; RP - 
Read pointer 
Stack pointer 
Stack 



DRA' 

DRB 

DRB' 

MPC 

POP 

PUSH 

RCA 

RCB 

READ 

RESET 

RP 

SP 

STK 



1 

0 
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Address Selection 



c/> 

Z 

> 

O 
H 
00 
00 
_* 

00 



Y-output multiplexer controls MUX2-MUX0 select one of eight 3-source branches as 
shown in Table 4. The states of CC and ZERO determine which of the three sources 
is selected as the next address. ZERO is set at the beginning of any cycle in which 
a register/counter will decrement to zero. 

Table 4. Output Controls (MUX2-MUX0) 



MUX2- 
MUX0 


RESET 


Y OUTPUT SOURCE 


CC 


- L 


CC - H 


ZERO - L 


ZERO = H 


XXX 


Yes 


All Low 


All Low 


All Low 


LLL 


No 


STK 


MPC 


DRA 


LLH 


No 


STK 


MPC 


DRB 


LHL 


No 


STK 


DRA 


MPC 


LHH 


No 


STK 


DRB 


MPC 


HLL 


No 


DRA 


MPC 


DRB 


HLH 


No 


DRA't 


MPC 


DRB'* 


HHL 


No 


DRA 


STK 


MPC 


HHH 


No 


DRB 


STK 


MPC 



t DRA1 5-DRA4::B3-B0 
*DRB15-DRB4::B3-B0 



By programming CC high or low without decrementing registers, only one outcome 
is possible; thus, unconditional branches or continues can be implemented by forcing 
the condition code. Alternatively, GC can be selected from status, in which case Branch 
A on Condition Code Else Branch B instructions are possible, where A and B are the 
address sources determined by MUX2-MUX0. 

Decrement and Branch on Nonzero instructions, creating loops that repeat until a 
terminal count is reached, can be implemented by programming CC low and 
decrementing a register/counter. If CO is selected from status and registers are 
decremented, more complex instructions such as Exit on Condition Code or End or 
Loop are possible. 

When MUX2-MUX0 = HLH, the B3-B0 inputs can replace the four least significant 
bits of DRA or DRB to create 16-Way branches or, when CC is based on status, to 
create 32-way branches. 

Stack Controls 

As in the case of the MUX controls, each stack-control coding is a three-way choice 
based on CC and ZERO (see Table 5). This allows push, pop, or hold stack operations 
to occur in parallel with the aforementioned branches. A subroutine call is accomplished 
by combining a branch and push, while returns result from coding a branch to stack 
with a pop. 
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Table 5. Stack Controls (S2-S0) 







STACK OPERATION 


S2-S0 


OSEL 


CC 


L 


CC - H 






ZERO = L 


ZERO = H 


LLL 


X 


Reset/Clear 


Reset/Clear 


Reset/Clear 


LLn 


v 

A 


Clear SP/RP 


Hold 


Hold 


LHL 


X 


Hold 


Pop 


Pop 


LHH 


X 


Pop 


Hold 


Hold 


HLL 


X 


Hold 


Push 


Push 


HLH 


X 


Push 


Hold 


Hold 


HHL 


X 


Push 


Hold 


Push 


HHH 


H 


Read 


Read 


Read 


HHH 


L 


Hold 


Hold 


Hold 



Combining stack and MUX controls with status results and register decrements permits 
even greater complexity. For example: Return on Condition Code or End of Loop; Call A 
on Condition Code Else Branch to B; Decrement and Return on Nonzero; Call 1 6-Way. 

Diagnostic stack dumps are possible using Read (S2-S0 = HHH) when OSEL is set high. 

Register Controls 

Unlike stack and MUX controls, register control is not dependent upon CC and ZERO. 
Registers can be independently loaded, decremented, or held using register control 
inputs RC2-RC0 (see Table 6). All combinations are supported with the exception of 
simultaneous register decrements. The register control inputs can be set to store branch 
addresses and loop counts or to decrement loop counts, facilitating the complex 
branching instructions described above. 



Table 6. Register Controls (RC2-RC0) 



RC2-RC0 


REGISTER OPERATIONS 


REG A 


REG B 


LLL 


Hold 


Hold 


LLH 


Decrement 


Hold 


LHL 


Load 


Hold 


LHH 


Decrement 


Load 


HLL 


Load 


Load 


HLH 


Hold 


Decrement 


HHL 


Hold 


Load 


HHH 


Load 


Decrement 



The contents of RCA are accessible to the DRA port when OSEL is low and the output 
bus is en abled by RAOE being low. Data from RCB is available when DRB is enabled 
by RBOE being low. 
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Continue/Repeat Instructions 



c/> 
z 

> 

o 

H 
00 
00 
—a 

00 



The most commonly used instruction is a continue, implemented by selecting MPC 
at the Y output MUX and setting INC high. If MPC is selected and INC is off, the current 
instruction will simply be repeated. 

A repeat instruction can be implemented in two ways. A programmed repeat (INC 
I forced low) may be useful in generating wait states, for example, wait for interrupt. 
A conditional repeat (INC originates from status) may be useful in implementing Do 
While operations. Several bit patterns in the MUX control field of the microinstruction 
will place MPC on the microaddress bus. Continue/repeat instructions are summarized 
in Table 7 below. 



Table 7. Continue/Repeat Encodings 



MUX2-MUX0 


S2-S0 


OSEL 


cc = 


H 


LHL 


LLH 


X 


CONT/RPT 




LHL 


LHL 


X 


CONT/RPT: 


POP 


LHL 


HLL 


X 


CONT/RPT: 


PUSH 


LHL 


HHH 


0 


CONT/RPT 




LHL 


HHH 


1 


CONT/RPT: 


READ 


LHH 


LLH 


X 


CONT/RPT 




LHH 


LHL 


X 


CONT/RPT: 


POP 


LHH 


HLL 


X 


CONT/RPT: 


PUSH 


LHH 


HHH 


0 


CONT/RPT 




LHH 


HHH 


1 


CONT/RPT: 


READ 


HHL 


LLH 


X 


CONT/RPT 




HHL 


LHL 


X 


CONT/RPT: 


POP 


HHL 


LHH 


X 


CONT/RPT 




HHL 


HLL 


X 


CONT/RPT: 


PUSH 


HHL 


HHH 


0 


CONT/RPT 




HHL 


HHH 


1 


CONT/RPT: 


READ 


HHH 


LLH 


X 


CONT/RPT 




HHH 


LHL 


X 


CONT/RPT: 


POP 


HHH 


LHH 


X 


CONT/RPT 




HHH 


HLL 


X 


CONT/RPT: 


PUSH 


HHH 


HHH 


0 


CONT/RPT 




HHH 


HHH 


1 


CONT/RPT: 


READ 



Branch Instructions 

A branch or jump to a given microaddress can also be coded several ways. RCA, DRA, 
RCB, DRB, and STK are possible sources for branch addresses (see Table 4). Branches 
to register or stack are useful whenever the branch address could be stored to reduce 
overhead. 
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The simplest branches are to DRA and DRB> since they require only one cycle and 
the branch address is supplied in the microinstruction. Use of registers or stack requires 
an initial load cycle (which may be combined with a preceding instruction), but may 
be more practical when an entry point is referenced over and over throughout the 
microprogram, for example, in error-handling routines. Branches to stack or register 
also enhance sequencing techniques in which a branch address is dynamically 
computed or multiple branches to a common entry point are used, but the entry point 
varies according to the system state. In this case, the state change might require 
reloading the stack or register. 



00 



In order to force a branch to DRA or DRB, CC must be programmed high or low. A 

branch to stack is only possible when CC is forced low (see Table 4). 00 

00 

When CC is low, the ZERO flag is tested, and if a register decrements to zero the Jt 

branch will be transformed into a Decrement and Branch on Nonzero instruction. ^ 

Therefore, registers should not be decremented during branch instructions using ^ 

CC = 0 unless it is certain the register will not reach terminal count. Branch instructions ^ 

are summarized in Table 8, below. Call (Branch and Push MPC) instructions and Return ^ 
(Branch to Stack and Pop) instructions are discussed in later sections. 



Table 8. Branch Encodings 



MUX2-MUX0 


S2-S0 


OSEL 


CC - H 


LLL 


LLH 


X 


BR 


A 




LLL 


LHL 


X 


BR 


A: 


POP 


LLL 


HHH 


0 


BR 


A 




LLL 


HHH 


1 


BR 


A: 


READ 


LLH 


LLH 


X 


BR 


B 




LLH 


LHL 


X 


BR 


B: 


POP 


LLH 


HHH 


0 


BR 


B 




LLH 


HHH 


1 


BR 


B: 


READ 


HLL 


LLH 


X 


BR 


B 




HLL 


LHL 


X 


BR 


B: 


POP 


HLL 


LHH 


X 


BR 


B 




HLL 


HHH 


0 


BR 


B 




HLL 


HHH 


1 


BR 


B: 


READ 


HLH 


LLH 


X 


BR 


B' 


(16-way) 


HLH 


LHL 


X 


BR 


B' 


(16- way) : POP 


HLH 


LHH 


X 


BR 


B' 


(16-way) 


HLH 


HHH 


0 


BR 


B' 


(16-way) 


HLH 


HHH 


1 


BR 


B' 


(16-way): READ 


LLL 


LLH 


X 


BR 


S: 


CLR SP/RP 


LLL 


LHL 


X 


BR 


S 
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Table 8. Branch Encodings (Continued) 



00 
00 

00 



MUX2-MUX0 


S2-S0 


OSEL 


CC - H 


LLL 


HLL 


X 


BR S 




LLL 


HHH 


0 


BR S 




LLL 


HHH 


1 


BR S: 


READ 


LLH 


LLH 


X 


BR S: 


CLR SP/RP 


LLH 


LHL 


X 


BR S 




LLH 


HLL 


X 


BR S 




LLH 


HHH 


0 


BR S 




LLH 


HHH 


1 


BR S: 


READ 


LHL 


LLH 


X 


BR S: 


CLR SP/RP 


LHL 


LHL 


X 


BR S 




LHL 


HLL 


X 


BR S 




LHL 


HHH 


0 


BR S 




LHL 


HHH 


1 


BR S: 


READ 


LHH 


LLH 


X 


BR S: 


CLR SP/RP 


LHH 


LHL 


X 


BR S 




LHH 


HLL 


X 


BR S 




LHH 


HHH 


0 


BR S 




LHH 


HHH 


1 


BR S: 


READ 


HLL 


LLH 


X 


BR A: 


CLR SP/RP 


HLL 


LHL 


X 


BR A 




HLL 


LHH 


X 


BR A.- 


POP 


HLL 


HLL 


X 


BR A 




HLL 


HHH 


0 


BR A 




HLL 


HHH 


1 


BR A: 


READ 


HLH 


LLH 


X 


BR A' 


(16-way): CLR SP/RP 


HLH 


LHL 


X 


BR A' 


(16-way) 


HLH 


LHH 


X 


BR A' 


(16-way): POP 


HLH 


HLL 


X 


BR A' 


(16-way) 


HLH 


HHH 


0 


BR A' 


(16-way) 


HLH 


HHH 


1 


BR A' 


(16- way): READ 


HHL 


LLH 


X 


BR A: 


CLR SP/RP 


HHL 


LHL 


X 


BR A 




HHL 


LHH 


X 


BR A: 


POP 


HHL 


HLL 


X 


BR A 




HHL 


HHH 


0 


BR A 




HHL 


HHH 


1 


BR A: 


READ 
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Table 8. Branch Encodings (Concluded) 



MUX2-MUX0 


S2-S0 


OSEL 


CC - H 


HHH 


LLH 


X 


BR B: CLR SP/RP 


HHH 


LHL 


X 


BR B 


HHH 


LHH 


X 


BR B: POP 


HHH 


HLL 


X 


BR B 


HHK 


HHH 


0 


BR B 


HHH 


HHH 


1 


BR B: READ 



Conditional Branch Instructions 

Perhaps the most useful of all branches is the conditional branch. The 'ACT8818 
permits three modes of conditional branching: Branch on Condition Code; Branch 
1 6-Way from DRA or DRB; and Branch on Condition Code 16-Way from DRA Else 
Branch 1 6-Way from DRB. This increases the versatility of the system and the speed 
of processing status tests because both single-bit and 4-bit status are allowed. 

Testing single bit status is preferred when the status can be set up and selected through 
a status MUX prior to the conditional branch. Four-bit status allows the 'ACT881 8 
to process instructions based on Boolean status expressions, such as Branch if Overflow 
and Not Carry if Zero or if Negative. It also permits true n-way branches, such as If 
Negative then Branch to X, Else if Overflow, and Not Carry then Branch to Y. The 
tradeoff is speed versus program size. Since multiway branching occurs relatively 
infrequently in most programs, users will enjoy increased speed at a negligible cost. 
Conditional branching codes are listed in Table 9. Call (Branch and Push MPC) 
instructions and Return (Branch to Stack and Pop) instructions are discussed in later 
sections. 



Table 9. Conditional Branch Encodings 



MUX2- 
MUXO 


S2-S0 


OSEL 


CC -■ L 


CC - H 


LLL 


LLH 


X 


BR S: 


CLR SP/RP 


BR A 


LLL 


LHL 


X 


BR S 




BR A: POP 


LLL 


HLL 


X 


BR S 




CALL A 


LLL 


HHH 


0 


BR S 




BR A 


LLL 


HHH 


1 


BR S: 


READ 


BR A: READ 


LLH 


LLH 


X 


BR S: 


CLR SP/RP 


BR B 


LLH 


HHH 


0 


BR S 




BR B 


LLH 


LHL 


X 


BR S 




BR B: POP 


LLH 


HLL 


X 


BR S 




CALL B 


LLH 


HHH 


1 


BR S: 


READ 


BR B: READ 


LHL 


LLH 


X 


BR S: 


CLR SP/RP 


CONT/RPT 
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Table 9. Conditional Branch Encodings (Concluded) 



MUX2- 
MUXO 


S2-S0 


OSEL 


CC - L 




CC 


- H 


LHL 


LHL 


X 


BR S 




CONT/RPT: 


POP 


LHL 


HLL 


X 


BR S 




CONT/RPT: 


PUSH 


LHL 


HHH 


0 


BR S 




CONT/RPT 




LHL 


HHH 


1 


BR S: 


READ 


CONT/RPT: 


READ 


LHH 


LLH 


X 


BR S: 


ri R SP/RP 


CONT/RPT 




LHH 


LHL 


X 


BR S 




CONT/RPT: 


POP 


LHH 


HLL 


X 


BR S 




CONT/RPT: 


PUSH 


LHH 


HHH 


0 


BR S 




CONT/RPT 




LHH 


HHH 


1 


BR S: 


READ 


CONT/RPT: 


READ 


HLL 


LLH 


X 


BR A: 


CLR SP/RP 


BR B 






HLL 


LHL 


X 


BR A 




BR B: 


POP 




HLL 


LHH 


X 


BR A: 


POP 


BR B 






HLL 


HLL 


X 


BR A 




CALL 


B 




HLL 


HHH 


0 


BR A 




BR B 






HLL 


HHH 


1 


BR A: 


READ 


BR B.- 


READ 


HLH 


LLH 


X 


BR A' 


(16 .wav) 1 CLR SP/RP 


BR B' 


(16- way) 


HLH 


LHL 


X 


BRA' 


l i u way / 


BR B' 


(16- way): POP 


HLH 


LHH 


X 


BR A' 


(16-wavl* POP 


BR B' 


(16-way) 


HLH 


HLL 


X 


BR A' 


\ 1 \j way; 


CALL 


B' (16-way) 


HLH 


HHH 


0 


BR A' 


\ i vJ way / 


BR B' 


(16-way) 


HLH 


HHH 


1 


BR A' 


(16-wav)- READ 


BR B' 


(16-way): READ 


HHL 


LLH 


X 


BR A: 


CLR SP/RP 


CONT/RPT 




HHL 


LHL 


X 


BR A 




CONT/RPT: 


POP 


HHL 


LHH 


X 


BR A: 


POP 


CONT/RPT 




HHL 


HLL 


X 


BR A 




CONT/RPT: 


PUSH 


HHL 


HHH 


0 


BR A 




CONT/RPT 




HHL 


HHH 


1 


BR A: 


READ 


CONT/RPT: 


READ 


HHH 


LLH 


X 


BR B: 


CLR SP/RP 


CONT/RPT 




HHH 


LHL 


X 


BR B 




CONT/RPT: 


POP 


HHH 


LHH 


X 


BR B: 


POP 


CONT/RPT 




HHH 


HLL 


X 


BR B 




CONT/RPT: 


PUSH 


HHH 


HHH 


0 


BR B 




CONT/RPT 




HHH 


HHH 


1 


BR B: 


READ 


CONT/RPT: 


READ 
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Loop Instructions 



Up to two levels of nested loops are possible when both counters are used 
simultaneously. Loop count and levels of nesting can be increased by adding external 
counters if desired. The simplest and most widely used of the loop instructions is 
Decrement and Branch on Nonzero, in which CC is forced low while a register is 
decremented. As before, many forms are possible, since the top-of-loop address can 
originate from RCA, DRA, RGB, DRB, or the stack (see Table 4). Upon terminal count, 
instruction flow can either drop out of the bottom of the loop or branch elsewhere. 

When loops are used in conjunction with CC as status, B3-B0 as status and/or stack 00 

manipulation, many useful instructions are possible, including Decrement and Branch 

on Nonzero else Return, Decrement and Call on Nonzero, and Decrement and Branch 00 

1 6-Way on Nonzero. Possible variations are summarized in Table 1 0. Call (Branch and q 

Push MPC) instructions and Return (Branch to Stack and Pop) instructions are discussed ^ 

in later sections. *t 

_ ' ^ 
Another level of complexity is possible if CC is selected from status while looping. Z 

This type of loop will exit either because CC is true or because a terminal count has 

been reached. This makes it possible, for example, to search the ALU for a bit string. 

If the string is found, the match forces CC high. However, if no match is found, it 

is necessary to terminate the process when the entire word has been scanned. This 

complex process can then be implemented in a simple compact loop using Conditional 

Decrement and Branch on Nonzero. 
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Table 10. Decrement and Branch on Nonzero Encodings 



00 



MUX2- 
MUXO 


dC'dU 


UOCL 






CC 


= L 


UU ■» rl 


ZERO - L 


ZERO = H 


LLL 


LLH 


X 


BR 


S: 


CLRSP/RP 


CONT/RPT 


BR A 


LLL 


LHL 


X 


BR 


S 




CONT/RPT: POP 


BR A: POP 


LLL 


HLL 


X 


BR 


S 




CONT/RPT: PUSH 


CALL A 


LLL 


HHH 


0 


BR 


S 




CONT/RPT 


BR A 


LLL 


HHH 


1 


BR 


S: 


READ 


CONT/RPT: READ 


BR A 


LLH 


LLH 


X 


BR 


S: 


CLR SP/RP 


CONT/RPT 


BR B 


LLH 


LHL 


X 


BR 


S 




CONT/RPT: POP 


BR B: POP 


LLH 


HLL 


X 


BR 


S 




CONT/RPT: PUSH 


CALL B 


LLH 


HHH 


0 


BR 


S 




CONT/RPT 


BR B 


LLH 


HHH 


1 


BR 


S: 


READ 


CONT/RPT: READ 


BR B 


LHL 


LLH 


X 


BR 


S: 


CLR SP/RP 


BR A 


CONT/RPT 


LHL 


LHL 


X 


BR 


S 




BR A:. POP 


CONT/RPT: POP 


LHL 


HLL 


X 


BR 


S 




CALL A 


CONT/RPT: PUSH 


LHL 


HHH 


0 


BR 


S 




BR A 


CONT/RPT 


LHL 


HHH 


1 


BR 


S: 


READ 


BR A: READ 


CONT/RPT: READ 


LHH 


LLH 


X 


BR 


S: 


CLR SP/RP 


BR B 


CONT/RPT 


LHH 


LHL 


X 


BR 


S 




BR B: POP 


CONT/RPT: POP 


LHH 


HLL 


X 


BR 


S 




CALL B 


CONT/RPT: PUSH 


LHH 


HHH 


0 


BR 


S 




BR B 


CONT/RPT 


LHH 


HHH 


1 


BR 


S: 


READ 


BR B: READ 


CONT/RPT: READ 


HLL 


LLH 


X 


BR 


A: 


CLR SP/RP 


CONT/RPT 


BR B 


HLL 


LHL 


X 


BR 


A 




CONT/RPT: POP 


BR B: POP 


HLL 


LHH 


X 


BR 


A: 


POP 


CONT/RPT 


BR B 


HLL 


HLL 


X 


BR 


A 




CONT: PUSH 


CALL B 


HLL 


HHH 


0 


BR 


A 




CONT/RPT 


BR B 


HLL 


HHH 


1 


BR 


A: 


READ 


CONT/RPT: READ 


BR B: READ 


HLH 


LLH 


X 


BR 


A' 


(16-way): CLR SP/RP 


CONT/RPT 


BR B' (16-way) 


HLH 


LHL 


X 


BR 


A' 


(16-way) 


CONT/RPT: POP 


BR B' (16-way): POP 


HLH 


LHH 


X 


BR 


A' 


(16-way): POP 


CONT/RPT 


BR B' (16-way) 


HLH 


HLL 


X 


BR 


A' 


(16-way) 


CONT/RPT: PUSH 


CALL B'(16-way) 


HLH 


HHH 


0 


BR 


A' 


(16-way) 


CONT/RPT 


BR B' (16-way) 


HLH 


HHH 


1 


BR 


A' 


(16-way): READ 


CONT/RPT: READ 


BR B' (16-way): READ 


HHL 


LLH 


X 


BR 


A: 


CLR SP/RP 


BR S 


CONT/RPT 


HHL 


LHL 


x 


BR 


A 




RET 


CONT/RPT: POP 


HHL 


LHH 


X 


BR 


A: 


POP 


BR S 


CONT/RPT 


HHL 


HLL 


X 


BR 


A 




CALL S 


CONT/RPT: PUSH 


HHL 


HHH 


0 


BR 


A 




BR S 


CONT/RPT 


HHL 


HHH 


1 


BR 


A: 


READ 


BR S: READ 


CONT/RPT: READ 
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Table 10. Decrement and Branch on Nonzero Encodings (Concluded) 



MUX2- 


SE-SO 


OSEL 




CC 


- L 


CC - H 


MUXO 


ZERO - L 


ZERO « H 


HHH 


LLH 


X 


BR B: CLR SP/RP 




BR S 


CONT/RPT 


HHH 


LHL 


X 


BR B 




RET 


CONT/RPT: POP 


HHH 


LHH 


X 


BR B: POP 




BR S 


CONT/RPT 


HHH 


HLL 


X 


BR B 




CALL S 


CONT/RPT: PUSH 


HHH 


HHH 


0 


BR B 




BR S 


CONT/RPT 


HHH 


HHH 


1 


BR B: READ 




BR S: READ 


CONT/RPT: READ 



00 

■r 
00 
00 

h- 
o 
< 

z 



Subroutine Calls 

The various branch instructions described above can be merged with a push instruction 
to implement subroutine calls in a single cycle. Calls, conditional calls, and Decrement 
and Call on Nonzero are the most obvious. 

Since a push is conditional on CC and ZERO, many hybrid instructions are also possible, 
such as Call X on Condition Code Else Branch, or Decrement and Return on Nonzero 
Else Branch. Codes that cause subroutine calls are summarized in Tables 1 1 and 1 2. 



Table 11. Call Encodings without Register Decrements 



MUX2-MUX0 


S2-S0 


OSEL 


CC = L 


CC - H 


LLL 


HLH 


X 


CALLS 


BR A 


LLL 


HHL 


X 


CALLS 


CALL A 


LLH 


HLH 


X 


CALL S 


BR B 


LLH 


HHL 


X 


CALL S 


CALL B 


LHL 


HLH 


X 


CALL S 


CONT/RPT 


LHL 


HHL 


X 


CALL S 


CONT/RPT: PUSH 


LHH 


HLH 


X 


CALL S 


CONT/RPT 


LHH 


HHL 


X 


CALL S 


CONT/RPT: PUSH 


HLL 


HLH 


X 


CALL A 


BR B 


HLL 


HHL 


X 


CALL A 


CALL B 


HLH 


HLH 


X 


CALL A' (16-way) 


BR B' (16-way) 


HLH 


HHL 


X 


CALL A' (16-way) 


CALL B' (16-way) 


HHL 


HLH 


X 


CALL A 


CONT/RPT 


HHL 


HHL 


X 


CALL A 


CONT/RPT: PUSH 


HHH 


HLH 


X 


CALL B 


CONT/RPT 


HHH 


HHL 


X 


CALL B 


CONT/RPT: PUSH 
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Table 12. Call Encodings with Register Decrements 



MUX2- 




UOCL 




55 - 


L 


w# n 


MUXO 


ZERO - L 


ZERO - H 


LLL 


HLH 


X 


CALL S 




CONT/RPT 


BR A 


LLL 


HHL 


X 


CALL S 




CONT/RPT 


CALL A 


LLH 


HLH 


X 


CALL S 




CONT/RPT 


BR B 


LLH 


HHL 


X 


CALL S 




CONT/RPT 


CALL B 


LHL 


HLH 


X 


CALL S 




BR A 


CONT/RPT 


LHL 


HHL 


X 


CALL S 




BR A 


CONT/RPT: PUSH 


LHH 


HLH 


X 


CALL S 




BR B 


CONT/RPT 


LHH 


HHL 


X 


CALL S 




BR B 


CONT/RPT: PUSH 


HLL 


HLH 


X 


CALL A 




CONT/RPT 


BR B 


HLL 


HHL 


X 


CALL A 




CONT/RPT 


CALL B 


HLH 


HLH 


X 


CALL A' 


(16-way) 


CONT/RPT 


BR B' (16-way) 


HLH 


HHL 


X 


CALL A' 


(16-way) 


CONT/RPT 


CALL B' (16-way) 


HHL 


HLH 


X 


CALL A 




BR S 


CONT/RPT 


HHL 


HHL 


X 


CALL A 




BR S 


CONT/RPT: PUSH 


HHH 


HLH 


X 


CALL B 




BR S 


CONT/RPT 


HHH 


HHL 


X 


CALL B 




BR S 


CONT/RPT: PUSH 



z 

"si 

> 
O 
H 
00 
00 

00 



Subroutine Returns 

A return from subroutine can be implemented by coding a branch to stack with a pop. 
Since pop is also conditional on CC and ZERO, the complex forms discussed previously 
also apply to return instructions: Decrement and Return on Nonzero; Return on 
Condition Code; Branch on Condition Code Else Return. Return encodings are 
summarized in Tables 13 and 14. 



Table 13. Return Encodings without Register 
Decrements 



MUX2-MUX0 


S2-S0 


OSEL 


CC - L 


CC - H 


LLL 


LHH 


X 


RET 


BRA 


LLH 


LHH 


X 


RET 


BR B 


LHL 


LHH 


X 


RET 


CONT/RPT 


LHH 


LHH 


X 


RET 


CONT/RPT 
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Table 14. Return Encodings with Register Decrements 



MUX2-MUX0 


S2-S0 


OSEL 




CC 


= L 


CC - H 


ZERO - 


L 


ZERO H 


LLL 


LHH 


X 


RET 






CONT/RPT 


BRA 


LLH 


LHH 


X 


RET 






CONT/RPT 


BR B 


LHL 


LHH 


X 


RET 






BR A 


CONT/RPT 


LHH 


LHH 


X 


RET 






BR B 


CONT/RPT 



Reset 

Pulling the S2-S0 pins low clears the stack and read pointers, and zeroes the Y output 
multiplexer (See Table 5). 

Clear Pointers 

The stack and read pointers may be cleared without affecting the Y output multiplexer 
by setting S2-S0 to LLH and forcing CC low (see Table 5). 

Read Stack 

Placing a high value on all of the stack inputs (S2-S0) and OSEL places the ' ACT88 1 8 
into the read mode. At each low-to-high clock transition, the address pointed to by 
the read pointer is available at the DRA port and the read pointer is decremented. The 
bottom of the stack is detected by monitoring the stack warning/read error pin 
(STKWRN/RER). A high appears on the STKWRN/RER output when the stack contains 
one word and a read instruction is applied to the S2*S0 pins. This signifies that the 
last address has been read. 

The stack pointer and stack contents are unaffected by the read operation. Under 
normal push and pop operations, the read pointer is updated with the stack pointer 
and contains identical information. 

Interrupts 

Real-time vectored interrupt routines are supported for those applications where polling 
would impede system throughput. Any instruction, including pushes and pops, may 
be interrupted. To process an interrupt, the following procedure should be followed: 

1 . Place the bidirectional Y bus into a high-impedance state by forcing YOE high. 

2 . Force the interrupt entry point vector onto the Y bus. INC should be high. 

3 . Push the current value in the Interrupt Return register on the stack as the 
execution address to return to when interrupt handling is complete. 

The first instruction of the interrupt routine must push the address stored in the interrupt 
return register onto the stack so that proper return linkage is maintained. This is 
accomplished by setting INT and B1 low and coding a push on the stack. 
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Sample Microinstructions for the 'ACT8818 

Representative examples of instructions using the 'ACT8818 are given below. The 
examples assume a one-level pipeline system, in which the address and contents of 
the next instruction are being fetched while the current instruction is being executed, 
and an ALU status register contains the status results of the previous instruction. 

I Since the incrementer looks two addresses ahead of the address in the instruction 
register to set up some instructions such as continue or repeat, a set-up instruction 
has been included with each example. This shows the required state of both INC and 
C/> CC. CC must be set up early because the status register on which Y-output selection 
is typically based contains the results of the previous instruction. 

±> 

^ Flow diagrams and suggested code for the sample microinstructions are also given 
O below. Numbers inside the circles are microword address locations expressed as 
hexadecimal numbers. Fields in microinstructions are binary numbers except for inputs 
00 on DRA or DRB, which are also in hexadecimal. For a discussion of sequencing 



00 



instructions, see the preceding section on microprogramming. 
Continue 

To Continue (Instruction 10), this example uses an instruction in Table 7 with 
CONT/RPT in the instruction column and no stack operation. INC and CC must be 
programmed high one cycle ahead of instruction 10 for pipelining. 

Address Instruction MUX2-MUX0 S2-S0 R2-R0 OSEL CC INC DRA DRB 

(Set-up) XXX XXX XXX X 1 1 XXXX XXXX 

10 Continue 110 111 XXX 0 X X XXXX XXXX 

Continue and Pop 

To Continue and decrement the stack pointer (Pop), this example uses an instruction 
in Table 7 with CONT/RPT: POP in the instruction column. INC and CC are forced 
high in the previous instruction. 

Address Instruction MUX2-MUX0 S2-S0 R2-R0 OSEL CC INC DRA DRB 

(Set-up) XXX XXX XXX X 1 1 XXXX XXXX 

10 Continue/Pop 110 010 XXX X X X XXXX XXXX 

Continue and Push 

To Continue and push the microprogram counter onto the stack (Push), this example 
uses an instruction in Table 7 with CONT/RPT: PUSH in the instruction column. INC 
and CC are forced high one cycle ahead of Instruction 10 for pipelining. 

Address Instruction MUX2-MUX0 S2-S0 R2-R0 OSEL CC INC DRA DRB 

(Set-up) XXX XXX XXX X 1 1 XXXX XXXX 

10 Continue/Push 110 100 XXX 0 X X XXXX XXXX 
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Branch (Example 1) 

To Branch from address 10 to address 20, this example uses a BR A instruction from 
the CC - H column of Table 8. CC must be programmed high one cycle ahead of 
Instruction 10 for pipelining. 

Address Instruction MUX2-MUX0 S2-S0 R2-R0 OSEL CU INC DRA DRB 

XXX XXX XXX X 1 X XXXX XXXX 

BR A 000 1 1 1 XXX 0 X X 0020 XXXX 

09 Branch (Example 2) 

To Branch from address 1 0 to address 20, this example uses a BR A instruction from 

^ the CC = L column of Table 8. CC is programmed low in the previous instruction; 

O as a result, a ZERO test follows the condition code test in Instruction 10. To ensure 

Jjj that a ZERO = H condition will not occur, registers should not be decremented during 

00 this instruction. 
— * 

00 Address Instruction MUX2-MUX0 S2-S0 R2-R0 OSEL CC INC DRA DRB 

(Set-up) XXX XXX XXX X 0 X XXXX XXXX 

10 BRA 110 111 000 0 X X 0020 XXXX 

Sixteen-Way Branch 

To Branch 1 6-Way, this example uses a BR B' instruction in Table 8. CC is 
programmed high in the previous instruction. The branch address is derived from the 
concatenation DRB1 5-DRB4::B3-B0. 

Address Instruction MUX2-MUX0 S2-S0 R2-R0 OSEL CC INC DRA DRB 

(Set-up) XXX XXX XXX X 1 X XXXX XXXX 

10 BR B' 101 111 XXX 0 X X XXXX 0040 
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DRA-20 




CC-1 



IMPOSSIBLE 



Y-DRA 



Figure 8. Branch Example 1 



CC-0 



DRA-2Q 




IMPOSSIBLE 



00 
r- 

00 

ZERO»?">-^ IMPOSSIBLE* -j9 

O 
< 

Z 



Y-DRA 



© 



*no register decrement 

Figure 9. Branch Example 2 



© 



« — 


DRB-40 




r 






Y—DRB' 




[40j Hi) • • • I4F) 
Figure 10. Sixteen-Way Branch 
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Conditional Branch 



z 

> 
O 
H 
00 
00 

00 



To Branch to address 20 Else Continue to address 1 3, this example uses the first 
instruction from Table 9 with BR A in the CC = L column and CONT/RPT in the 
CO = H column. INC is set high in the preceding instruction to set up the Continue. 

Address Instruction MUX2-MUX0 S2-S0 R2-R0 OSEL CC INC DRA DRB 



(Set-up) 
10 



BR A else 
Continue 



XXX 
110 



XXX 

111 



XXX 
000 



X 
0 



1 

X 



xxxx xxxx 

0020 XXXX 



Three-Way Branch 

To Branch 3-Way, this example uses an instruction from Table 10 with BR A in the 
ZERO = L column, CONT/RPT in the ZERO = H column and BR B in the CC = H 
column. To enable the ZERO = H path, register A must decrement to zero during this 
instruction (see Table 6 for possible register operations). INC is programmed high in 
Instruction 10 to set up the Continue. 



Address 

(Set-up) 
10 



Instruction 



MUX2-MUX0 S2-S0 R2-R0 OSEL CC INC DRA DRB 



11 



XXX XXX 



1 XXXX XXXX 



1 xxxx xxxx 



X 0020 0030 



Continue and 

Load Reg A 110 111 010 0 

Decrement Reg A; 
Branch 3- Way 100 1 1 1 001 0 

t Selected from external status 

Thirty-Two-Way Branch 

To Branch 32-Way, this example uses an instruction from Table 9 with BR A' in the 
CC = L column and BR B' in the CC = H column. The four least significant bits of 
the DRA' and DRB' addresses must be input at the B3-B0 port; these are concatenated 
with the 1 2 most significant bits of DRA and DRB to provide new addresses DRA' 
(DRA1 5-DRA4::B3-B0) and DRB' (DRB1 5-DRB4::B3-B0). 



Address 



Instruction 



MUX2-MUX0 S2-S0 R2-R0 OSEL CC INC DRA DRB 



(Set-up) 

10 32-way Branch 



XXX 
101 



XXX 

111 



XXX 
000 



X 
0 



1 

X 



1 

X 



XXXX xxxx 
0040 0030 
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INC-1 



DRA-20 




IMPOSSIBLE* 



Y-MPC + 1 



Y-DRA 



*no register decrement 

Figure 11. Conditional Branch 



DRB 


-40 






DRA-30 


i 


« — 

f 




H 


^ CC 




L 






-? ^ 


ZERO-?^ 


■'1 


t 








Y — DRB' 






Y-DRA' 




1401 (4ll« • •14F1I30J l31J«»«f3F 



INC-1 




CC-1 





INC-1 



RCA -Count 



Y-MPC + 1 



5 



CO 



DRA-20 






» 


4 




© (s) (2) 



Figure 12. Three-Way Branch 



*no register decrement 

Figure 13. Thirty-Two-Way Branch 
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Repeat 



To Repeat (Instruction 10), this example uses an instruction in Table 7 with CONT/RPT 
in the instruction column. INC must be programmed low and CC high one cycle ahead 
of Instruction 10 for pipelining. 

Address Instruction MUX2-MUX0 S2-S0 R2-R0 OSEL CC INC DRA DRB 

2 (Set-up) XXX XXX XXX X 1 0 XXXX XXXX 

10 Continue 110 111 XXX 0X1 XXXX XXXX 

c/> Repeat on Stack 
2 

^J- To Continue and push the microprogram counter onto the stack (Push), this example 
^ uses an instruction in Table 7 with CONT/RPT: PUSH in the instruction column. INC 
O and CC must be forced high one cycle ahead for pipelining. 
H 

00 To Repeat (Instruction 1 2), an BR S instruction from the ZERO = L column of Table 8 
2 is used. To avoid a ZERO = H condition, registers are not decremented during this 
00 instruction (see Table 6 for possible register operations). CC and INC are 
programmed high in Instruction 1 2 to set up the Continue in Instruction 1 1 . 



Address 


Instruction 


MUX2-MUX0 


S2-S0 


R2-R0 


OSEL 


CC 


INC 


DRA 


DRB 


(Set-up) 




XXX 


XXX 


XXX 


X 


1 


1 


XXXX 


XXXX 


10 


Continue/Push 


110 


100 


XXX 


X 


1 


1 


XXXX 


XXXX 


11 


Continue 


110 


111 


XXX 


0 


0 


X 


XXXX 


XXXX 


12 


BR Stack 


010 


111 


000 


0 


1 


X 


XXXX 


XXXX 




Y-MPC 



Figure 14. Repeat 
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10 



CC-? 



IMPOSSIBLE 



IIMC-1 



PUSH 



CC~1 



Y-MPC+1 




IMPOSSIBLE 



Y-MPC+1 



CC-0 




IMPOSSIBLE 



IMPOSSIBLE* 



CC-1 



Stack 



*no register decrement 

Figure 15. Repeat on Stack 



00 

00 
00 
h- 
O 
< 

2 

■CO 
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Repeat Until CC = H 



Z 

> 

O 
H 
00 
00 

00 



To Continue and push the microprogram counter onto the stack (Push), this example 
uses an instruction in Table 7 with CONT/RPT: PUSH in the instruction column. INC 
and CC must be forced high one cycle ahead for pipelining. 

To Repeat Until CC = H (Instruction 12), an instruction from Table 9 with BR S 
in the CC = L column and CONT/RPT: POP in the CC = H column is used. To avoid 
a ZERO = H condition, registers are not decremented (See Table 6 for possible register 
operations). CC and INC are programmed high in Instruction 12 to set up the 
Continue in Instruction 1 1 . A consequence of this is that the instruction following 1 3 
cannot be conditional. 



Address 

(Set-up) 
10 
11 
12 



Instruction 

Continue/Push 
Continue 

BR Stack else 
Continue 



MUX2-MUX0 S2-S0 R2-R0 OSEL CC INC DRA DRB 



XXX 
110 
110 

010 



XXX 
100 

111 
111 



XXX 
XXX 
XXX 

000 



X 
X 
0 



xxxx xxxx 
xxxx xxxx 
xxxx xxxx 



1 xxxx xxxx 



"^Selected from external status 

Loop Until Zero 

To Continue and push the microprogram counter onto the stack (Push), this example 
uses an instruction in Table 7 with CONT/RPT: PUSH in the instruction column. INC 
and CC are forced high one cycle ahead for pipelining. Register A is loaded with 
the loop counter using a Load A instruction from Table 6. 

To decrement the loop count, a decrement register A and hold register B instruction 
from Table 6 is used. To Repeat Else Continue and Pop (decrement the stack pointer), 
an instruction from Table 9 with BR S in the ZERO = L column and CONT/RPT: POP 
in the ZERO = H column is used. CC is programmed low in Instruction 1 1 to 
force the ZERO test in Instruction 1 2; it is programmed high in Instruction 1 2 to set 
up the Continue in Instruction 11. 



Address 

(Set-up) 
10 
11 



12 



Instruction 

Continue/Push 
Continue/Load 
Reg A 
Decrement Reg A; 
BR S eise 
Continue: Pop 



MUX2-MUX0 S2-S0 R2^R0 OSEL CC INC DRA DRB 



XXX 
110 

110 



000 



XXX 
100 



XXX 
XXX 



X 
0 



xxxx xxxx 
xxxx xxxx 



111 010 



010 001 



0 1 xxxx xxxx 



1 xxxx xxxx 
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Conditional Loop Until Zero 



Two examples of a Conditional Loop on Stack with Exit are presented below. Both 
use the microcode shown below to branch to the stack on nonzero, continue and pop 
on zero, and branch to DRA with a pop if CC = H. In the first example, the value 
on the DRA bus is the same as the value in the microprogram counter, making the 
exit destinations on the CC and ZERO tests the same. In the second, the values are 
EH different, generating a two-way exit. 

^™ To Continue and push the microprogram counter onto the stack (Push), this example 
C/) uses an instruction in Table 7 with CONT/RPT: PUSH in the instruction column. INC 
^ must be high. CC is forced high in the preceding instruction for pipelining. 

^ To Continue (Instruction 11), this example uses an instruction in Table 7 with 
O CONT/RPT in the instruction column. INC must be high. CC must be programmed 
H high in the previous instruction. INC is programmed high to set up the Continue in 
§q Instruction 12. 

CO To Decrement and Branch else Exit (Instruction 1 2), an instruction from Table 1 0 with 
BR S in the ZERO = L column, CONT/RPT: POP in the ZERO = H column and BR A: POP 
in the CC = H column is used. 



Example 1: 



Address 


Instruction 


MUX2-MUX0 


S2-S0 


R2-R0 


OSEL 


CC 


INC 


DRA 


DRB 


(Set-up) 




XXX 


XXX 


XXX 


X 


1 


1 


XXXX 


XXXX 


10 


Continue/Push 


110 


111 


010 


0 


1 


1 


XXXX 


XXXX 




Load Reg A 


















11 


Continue 


110 


111 


XXX 


0 


t 


1 


XXXX 


XXXX 


12 


Decrement Reg A; 




















BR S else 




















Continue: Pop 




















else BR A: Pop 


000 


010 


001 


X 


X 


1 


0013 


XXXX 



* Selected from external status 



Example 2: 



Address 


Instruction 


MUX2-MUX0 


S2-S0 


R2-R0 


OSEL 


CC 


INC 


DRA 


DRB 


(Set-up) 




XXX 


XXX 


XXX 


X 


1 


1 


XXXX 


XXXX 


10 


Continue/Push 


110 


111 


010 


0 


1 


1 


XXXX 


XXXX 




Load Reg A 


















11 


Continue 


110 


111 


XXX 


0 


t 


1 


XXXX 


XXXX 


12 


Decrement Reg A; 




















BR S else 




















Continue: Pop 




















else BR A: Pop 


000 


010 


001 


X 


X 


X 


0025 


XXXX 



t Selected from external status 
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INC-1 


► 


« 


CC-1 










f 






IMPOSSIBLE 



RCA- 
Count 




H 


PUSH 


► 


« 






INC-1 


CC-1 


► 


4— 




IMPOSSIBLE 



INC-1 




H 


► 






Y-MPC+1 



DRA-25 


— *k — 


RCA — 
RCA-1 





INC-1 



INC-1 




Y-Stack 



INC-1 



Y-MPC+1 



POP 
SP-SP-1 



Y-DRA 



00 
i— 
00 
00 
H 

o 
< 

2 



Figure 18. Conditional Loop Until Zero (Example 2) 
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Jump to Subroutine 

To Call a Subroutine at address 30, this example uses the instruction from Table 1 1 
with CALL A in the CC = H column. CC is programmed high in the previous 
instruction. 



Address Instruction MUX2-MUX0 S2-S0 R2-R0 OSEL CC INC DRA DRB 



(Set-up) 
10 



Call A 



XXX 
000 



XXX 
110 



XXX 
XXX 



X 
X 



1 

X 



xxxx xxxx 

0030 XXXX 



2 

> 
o 

H 
00 
CO 
— k 

00 



Conditional Jump to Subroutine 



To conditionally Call a Subroutine at address 20, this example uses an instruction from 
Table 1 1_with CALL A in the CC = L column and CONT/RPT in the CC = H 
column. CC is generated by external status during the preceding instruction. INC is 
programmed high in the preceding instruction to set up the Continue. To avoid a 
ZERO = H condition, registers should not be decremented during Instruction 10. 

Address Instruction MUX2-MUX0 S2-S0 R2-R0 OSEL CC INC DRA DRB 



(Set-up) XXX XXX XXX X 

10 Call A else 

Continue 110 101 000 X 

* Selected from external status 



1 XXXX xxxx 
1 0020 XXXX 



Two-Way Jump to Subroutine 

To perform a Two-Way Call to Subroutine at address 20 or address 30, this example 
uses an instruction from Table 11 with CALL A in the CC = L column and CALL B 
in the CC = H column. In this example, CC is generated by external status during 
the preceding (set-up) instruction. INC is programmed high in the preceding instruction 
to set up the Push. To avoid a ZERO = H condition, registers should not be decremented 
during Instruction 10. 



Address Instruction 

(Set-up) 

23 Call A else 
Call B 

* Selected from external status 



MUX2-MUX0 S2-S0 R2-R0 OSEL CC 
XXX XXX XXX X t 



100 



110 000 



INC DRA DRB 
1 XXXX XXXX 

X 0020 0030 
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INC-1 




CC-1 






V 




DRA-30 






p 





IMPOSSIBLE 



1 


H 

4 


PUSH 




f 






Y-DRA 





® 



Figure 19. Jump to Subroutine 



INC - 1 



DRA-20 




-IMPOSSIBLE* 00 



Y-MPC+1 



PUSH 



Y-DRA 



~@ © 



oo 

00 

o 
< 

z 



*no register decrement 

Figure 20. Conditional Jump to Subroutine 



DRA-20 


|L _ 


DRB-30 






IMPOSSIBLE* 



PUSH 




L 


» 

i 




Y-DRA 



© 



PUSH 



* no register decrement 

Figure 21. Two- Way Jump to Subroutine 
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Return from Subroutine 



To Return from a subroutine, this example uses an instruction from Table 1 3 with RET 
in the CC = L column. CC is programmed low in the previous instruction. To 
avoid a ZERO = H condition, registers are not decremented during Instruction 23. 

Address Instruction MUX2-MUX0 S2-S0 R2-R0 OSEL CC INC DRA DRB 



(Set-up) 
23 



Return 



XXX 
010 



XXX 
011 



XXX 
000 



X 
X 



0 
0 



X 
X 



xxxx xxxx 
xxxx xxxx 



0> 
2 

> 

o 

H 
00 
00 

mmJk 

00 



Conditional Return from Subroutine 



To conditionally Return from a Subroutine, this example uses an instruction from 
Table 13 with RET in the CC = L column and CONT/RPT in the CC = H column. 
CC is selected from external status in the previous instruction/To avoid a ZERO = H 
condition, registers are not decremented during Instruction 23. 



Address Instruction 

(Set-up) 

23 Return else 
Continue 
* Selected from external status 



MUX2-MUX0 S2-S0 R2-R0 OSEL CC 
XXX XXX XXX X t 



010 



011 



000 



X 



1 



INC DRA ORB 
1 XXXX XXXX 

X XXXX XXXX 



Clear Pointers 

To Continue (Instruction 10), this example uses an instruction in Table 7 with 
CONT/RPT in the instruction column. INC must be high; CC must be programmed 
high in the previous instruction. To Clear the Stack and Read Pointers and Branch to 
address 40 (instruction 1 1), this example uses a BR A: Clear SP, RP instruction in 
Table 8. CC is programmed low in instruction 10 to set up the Branch. To avoid 
a ZERO = H condition, registers are not decremented during Instruction 11. 



Address 


Instruction 


MUX2-MUX0 


S2-S0 


R2-R0 


OSEL 


CC 


INC 


DRA 


(Set-up) 




XXX 


XXX 


XXX 


X 


1 


1 


XXXX 


10 


Continue 


110 


111 


000 


0 


0 


X 


0020 


11 


BR A and Clear 


















SP/RP 


110 


001 


000 


X 


X 


X 


XXXX 



Reset 

To Reset the 'ACT881 8, pull the S2-S0 pins low. This clears the stack and read pointers 
and places the Y bus into a low state. 

Address Instruction MUX2-MUX0 S2-S0 R2-R0 OSEL CC INC DRA DRB 
10 Reset XXX 000 XXX X X X XXXX XXXX 
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*no register decrement *no register decrement 

Figure 22. Return from Subroutine Figure 23. Conditional 

Return from Subroutine 
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2 

> 

O 
H 
00 
00 

CO 



INC-1 






CC-1 




4— 




IMPOSSIBLE 



1 


H 

4 

f 


CC-0 




Y-MPC + 1 




IMPOSSIBLE 



IMPOSSIBLE* 



*no register decrement 

Figure 24. Clear Pointers 
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SN74ACT8818 16-Bit Microsequencer 



SN74ACT8832 32-Bit Registered ALU 



SN74ACT8836 32- x 32-Bit Parallel Multiplier 




SN74ACT8837 64-Bit Floating Point Processor 



SN74ACT8841 Digital Crossbar Switch 



SN74ACT8847 64-Bit Floating Point/Integer Processor 




Support 



Mechanical Data 



3-1 



3 



C/> 
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SN74ACT8832 
CMOS 32 Bit Registered ALU 

• 50-ns Cycle Time 

• Low-Power EPIC™ CMOS 

• Three-Port I/O Architecture 

• 64-Word by 36-Bit Register File 

• Simultaneous ALU and Register Operations 

• Configurable as Quad 8-Bit or Dual 16-Bit Single <v) 

Instruction, Multiple Data Machine 00 

00 

• Parity Generation/Checking £j 

< 

The SN74ACT8832 is a 32-bit registered ALU that can operate at 20 MHz and ^ 
20 MIPS (million instructions per second). Most instructions can be performed jjjjjj 
in a single cycle. The 'ACT8832 was designed for applications that require high- (/) 
speed logical, arithmetic, and shift operations and bit/byte manipulations. 

The 'ACT8832 can act as host CPU or can accelerate a host microprocessor. 
In high-performance graphics systems, the 'ACT8832 generates display-list 
memory addresses and controls the display buffer. In I/O controller applications, 
the 'ACT8832 performs high-speed comparisons to initialize and end data 
transfers. 

A three-operand, 64-word by 36-bit register file allows the 'ACT8836 to create 
an instruction and store the previous result in a single cycle. 



EPIC is a trademark of Texas Instruments Incorporated. 
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Introduction 

The SIM74ACT8832 Registered Arithmetic/Logic Unit (ALU) holds a primary position 
in the Texas Instruments family of innovative 32-bit LSI devices. Compatible with the 
SN74AS888 architecture and instruction set, the 'ACT8832 performs as a high-speed 
microprogrammable 32-bit registered ALU which can also be configured to operate 
as two 1 6-bit ALUs or four 8-bit ALUs in single-instruction, multiple-data (SIMD) mode. 

Besides introducing the 'ACT8832, this section discusses basic concepts of 
microprogrammed architecture and the support tools available for system development. 
Details of the 'ACT8832 architecture and instruction set are presented. Pin descriptions 
and assignments for the 'ACT8832 are also presented. 

Understanding Microprogrammed Architecture £j 

oo 

Figure 1 shows a simple microprogrammed system. The three basic components are oo 
an arithmetic/logic unit, a microsequencer, and a memory. The program that resides £t 
in this memory is commonly called the microprogram, while the memory itself is referred ^ 
to as a micromemory or control store. The ALU performs all the required operations 
on data brought in from the external environment (main memory or peripherals, for ^ 
example). The sequencer is dedicated to generating the next micromemory address yy 
from which a microinstruction is to be fetched. The sequencer and the ALU operate 
in parallel so that data processing and next-address generation are carried out 
concurrently. 

The microprogram instruction, or microinstruction, consists of control information to 
the ALU and the sequencer. The microinstruction consists of a number of fields of 
code that directly access and control the ALU, registers, bus transceivers, multiplexers, 
and other system components. This high degree of programmability in a parallel 
architecture offers greater speed and flexibility than a typical microprocessor, although 
the microinstruction serves the same purpose as a microprocessor opcode: it specifies 
control information by which the user is able to implement desired data processing 
operations in a specified sequence. The microinstruction cycle is synchronized to a 
system clock by latching the instruction in the microinstruction, or pipeline, register 
once for each clock cycle. Status results are collected in a status register which the 
sequencer samples to produce conditional branches within the microprogram. 

ACT8832 Registered ALU 

This device comprises a 32-bit ALU, a 64-word by 36-bit register file, two shifters 
to support double-precision arithmetic, and three independent bidirectional data ports. 

The 'ACT8832 is engineered to support high-speed, high-level operations. The ALU's 
13 basic arithmetic and logic instructions can be combined with a single- or double- 
precision shift operation in one instruction cycle. Other instructions support data 
conversions, bit and byte operations, and other specialized functions. 
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Figure 1 . Microprogrammed System Block Diagram 



The configuration of this processor enchances processing throughput in arithmetic 
and radix conversion. Internal generation and testing of status results in fast processing 
of division and multiplication algorithms. This decision logic is transparent to the user; 
the reduced overhead assures shorter microprograms, reduced hardware complexity, 
and shorter software development time. 

Support Tools 

Texas Instruments has designed a family of low-cost, real-time evaluation modules 
(EVM) to aid with initial hardware and microcode design. Each EVM is a small self- 
contained system which provides a convenient means to test and debug simple 
microcode, allowing software and hardware evaluation of components and their 
operation. 

At present, the 74AS-EVM-8 Bit-Slice Evaluation Module has been completed, and 
1 6- and 32-bit EVMs are in advanced stages of development. EVMs and support tools 
for other devices in the 'AS8800 family are also planned for future development. 
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Design Support 



Texas Instruments Regional Technology Centers, staffed with systems-oriented 
engineers, offer a training course to assist users of Tl's LSI products and their 
application to digital processor systems. Specific attention is given to the understanding 
and generation of design techniques which implement efficient algorithms designed 
to match high-performance hardware capabilities with desired performance levels. 

Information on LSI devices and product support can be obtained from the following 
Regional Technology Centers: 



Atlanta 

Texas Instruments Incorporated 
3300 N.E. Expressway, Building 8 
Atlanta, GA 30341 
404/662-7945 

Boston 

Texas Instruments Incorporated 
950 Winter St. Suite 2800 
Waltham, MA 02154 
617/895-9100 

Northern California 

Texas Instruments Incorporated 

5353 Betsy Ross Drive 

Santa Clara, CA 95054 

408/748-2220 



Chicago 

Texas Instruments Incorporated 

51 5 Algonquin 

Arlington Heights, IL 60005 

312/640-2909 

Dallas 

Texas Instruments Incorporated 
10001 E. Campbell Road 
Richardson, TX 75081 
214/680-5066 

Southern California 

Texas Instruments Incorporated 

17891 Cartwright Drive 

Irvine, CA 92714 

714/660-8140 
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Design Expertise 

Texas Instruments can provide in-depth technical design assistance through 
consultations with contract design services. Contact your local Field Sales Engineer 
for current information or contact VLSI Systems Engineering at 214/997-3970. 
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'ACT8832 Pin Descriptions 

Pin descriptions and grid allocations for the 'ACT8832 are given on the following pages. 
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Table 1. SN74ACT8832 Pin Grid Allocation 
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Table 2. SIM74ACT8832 Pin Description 



PIN 

NAME NO. 


I/O 


DESCRIPTION 


AO R7 
A1 P7 
A2 T6 
A3 S6 
A4 R6 
A5 P6 


I 


Register file A port read address select 


BO T12 
B1 R10 
B2 S11 
B3 T11 
B4 S10 
B5 T10 


I 


Register file B port read address select 


BYOFO B2 
BYOF1 A4 
BYOF2 D13 
BYOF3 C17 


0 


Status signals indicate overflow conditions 
in certain data bytes 


G C10 


0 


Status signal representing carry out condition 


CO S13 
C1 T14 
C2 R11 
C3 S12 
C4 P10 
C5 T13 


I 


Register fife write address select 


CFO E2 
CF1 D1 
CF2 F4 


I 


Configuration mode select, single 32-bit, two 
16-bit, or four 8-bit ALU's 


Cn F2 


I 


ALU carry input 


CLK F3 


I 


Clocks synchronous registers on positive edge 


DAO K3 
DA1 M1 
DA2 12 
DA3 N1 
DA4 M2 
DA 5 P1 
DA6 N2 
DA7 M3 
DA8 T2 
DA9 P4 


I/O 


A port data bus. Outputs register data (OEA = 0} 
or inputs external data (0EA = 1). 



Table 2. SIM74ACT8832 Pin Description (Continued) 



PIN 



NAME 


NO. 


DA10 


S3 


DA11 


R4 


DA12 


T3 


DA13 


S4 


DA14 


T4 


DA15 


P5 


DA16 


P14 


DA17 


T17 


DA18 


P15 


DA19 


N14 


DA20 


R16 


DA21 


S17 


DA22 


P16 


DA23 


N15 


DA24 


K15 


DA25 


K16 


DA26 


K17 


DA27 
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DA28 
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DA29 


J17 


DA30 


H17 


DA31 
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DBO 


G1 


DB1 


H2 


DB2 


H1 


DB3 


J1 


DB4 


J2 


DB5 


J3 


DB6 


K1 


DB7 


K2 


DB8 


P2 


DB9 


N3 


DB10 


S1 


DB11 


R2 


DB12 


P3 


DB13 


N4 


DB14 


T1 


DB15 


S2 



I/O 



DESCRIPTION 



I/O 



A port data bus. Outputs register data (OEA = 0) 
or inputs external data (OEA = 1). 



I/O 



B port data bus. Outputs register data (OEB = 0) 
or used to input external data (OEB = 1) 
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Table 2. SN74ACT8832 Pin Description (Continued) 



DIM 

NAME 


NO. 


I/O 


DESCRIPTION 


DB16 


T15 






DB17 


S14 






DB18 


R13 






DB19 


T16 






DB20 


S15 






DB21 


R14 






DB22 


P13 






DB23 
DB24 


S16 
R1 7 


I/O 


B port data bus. Outputs register data (OEB = 0) 
or used to input external data (OEB = 1 ) 
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CM 
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\3<C 


1 
1 


external DA bus and low state selects 
register file 
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G3 


1 
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FR1 

CD 1 
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icyioitJi iiic, cAit/iiidi uo pun aiiu iviv_i icyicsLci 


GND 


C8 






GND 


D6 






GND 


D7 






GND 


D8 






GND 


D10 






GND 


D1 1 






GND 


D12 






GND 


G4 






GND 
GND 


G14 
H4 




Ground pins. All ground pins must be used. 


GND 


H14 






GND 


K4 






GND 


K14 






GND 


L4 






GND 


L14 






GND 


M4 






GND 


P9 






GND 


P12 
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Table 2. SN74ACT8832 Pin Description (Continued) 



PIN 

NAME 


NO. 


I/O 


DESCRIPTION 


10 


D1 7 






11 


F15 






12 


E16 






13 
14 


E17 
F16 


I 


Instruction input 


15 


G15 






16 


F17 






17 


G16 






IESIO0 


A8 






IESI01 


A7 


I 


Shift pin enables, increases system speed and 


IESI02 


B7 




reduces bus conflict, active low 


IESI03 


B6 






MSERR 


B1 1 


0 


Master Slave Error pin, indicates error between 




data at Y output MUX and external Y port 


N 


A10 


0 


Output status signal representing sign condition 


OEA 


T5 


I 


DA bus enable, active low 


OEB 


R12 


I 


DB bus enable, active low 


OES 


A11 


I 


Status enable, active low 


OEYO 


C3 






OEY1 
OEY2 


C7 
C14 


I 


Y bus output enable, active low 


OEY3 


F14 






OVR 


B10 


0 


Output status signal represents overflow condition 


PAO 


R1 






PA1 
PA 2 


R5 
M14 


I/O 


Parity bits port for DA data 


PA3 


G17 






PBO 


L1 






PB1 
PB2 


R3 
R15 


I/O 


Parity bits port for DB data 


PB3 


L17 






PERRA 


S5 


0 


DA data parity error, signals error if an even parity 
check fails for any byte 


PERRB 


P11 


0 


DB data parity error, signals error if an even parity 
check fails for any byte 


PERRY 


C11 


0 


Y data parity error, signals error if an even parity 
check fails for any byte 



Table 2. SN74ACT8832 Pin Description (Continued) 



PIN 

NAME NO. 


I/O 


DESCRIPTION 


PYO D4 
PY1 B5 
PY2 B15 
PY3 C16 


i/n 


Y port psrity data, input and output 


RFCLK S9 


I 


Register File Clock, allows multiple writes to be 
performed in one master clock cycle 


SELMQ E1 


I 


MQ register select, selects output of ALU shifter or 
MQ register to be placed on Y bus 


SELRFO T9 
SELRF1 T8 


I 


Register File select. Controls selection of the 
Register File(RF) inputs by the RF MUX 


SIOO A9 
SIOT B8 

5102 A6 

5103 A5 


I/O 


Bidirectional shift pin, active low 


SSF A12 


I 


Special Shift Function, implements conditional 
shift algorithms 


TPO E15 
TP1 D16 


I/O 


Test pins, supports system testing 


V C C C9 
V C C D9 
Vcc H3 
V C C HI 5 
V C C J* 
V CC J14 
Vcc L3 
V C C L15 
V C C 

v C c R9 




Supply voltrage (5 V) 


WEO S7 
WET T7 
WE2 R8 
WE3 S8 


I 


Register File WRITE ENABLE. Data is written into RF 
when write enables are low and a low to high 
Register File Clock (RFCLK) transition occurs. 
Active low. 



< 
z 

CO 
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Table 2. SN74ACT8832 Pin Description (Concluded) 



PIN 

NAME 


NO. 




nCCP'DIDTlOM 

UcoUnlr 1 IUIM 


YO 


E3 






Y1 


D2 






Y2 


C1 






Y3 


D3 






Y4 


E4 






Y5 


C2 






Y6 


B1 






Y7 


A1 






Y8 


D5 






Y9 


C4 






Y10 


B3 






Y11 


C5 






Y12 


B4 






Y13 


A2 






Y14 


C6 






Y15 


A3 


I/O 


Y port data bus 


Y16 


B12 






Y17 


C12 






Y18 


A13 






Y19 


B13 






Y20 


A14 






Y21 


B14 






Y22 


C13 






Y23 


A15 






Y24 


A16 






Y25 


A17 






Y26 


B16 






Y27 


D14 






Y28 


C15 






Y29 


B17 






Y30 


E14 






Y31 


D15 






Z 


B9 


0 


Output status signal represents zero condition 



7 ACT8832 Specification Tables 

absolute maximum ratings over operating free-air temperature range 
(unless otherwise noted) t 



Supply voltage, Vcc • • • • . -0.5 V to 6 V 

Input clamp current, l|K (V| < 0 or Vj > Vcc) ............. ±20 mA 

Output clamp current, Iqk 'Vo < 0 or Vo > Vcc) • • • ±50 mA 

Continuous output current, lo (Vo = 0 to Vcc) • • • • ±50 mA 

Continuous current through Vcc or GND pins ± 100 mA 

Operating free-air temperature range . 0°C to 70 °C 

Storage temperature range . -65°C to 150°C 



Stresses beyond those listed under "absolute maximum ratings" may cause permanent damage to the device. 
These are stress ratings only and functional operation of the device at these or any other conditions beyond 
those indicated under "recommended operating conditions" is not implied. Exposure to absolute-maximum- 
rated conditions for extended periods may affect device reliability. 



Table 3. Recommended Operating Conditions 



PARAMETER 


MIN NOM MAX 


UNIT 


Vcc Supply voltage 


4.5 5.0 5,5 


V 


V|H High-level input voltage 




V 


V|L Low-level input voltage 


0 ,#'"0.8 


V 


'OH High-level output current 


— 8 

: ■ 


mA 


Iql Low-level output current 


8 


mA 


V| Input voltage 


% .i — 

0 vpp 


V 


Vq Output voltage 


o v C c 


V 


dt/dv Input transition rise or fall rate 


0 15 


ns/V 


Ta Operating free-air temperature 


0 70 


°C 
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Table 4. Electrical Characteristics 



PARAMETER 


TEST CONDITIONS 


Vcc 


T A - 25°C 


MIN MAX 


UNIT 


MIN TYP MAX 


VOH 


Iqh - 20 fiA 


4.5 V 


4.49 




V 


5.5 V 


5.49 




Iqh - "8 mA 


4.5 V 


■ is~ — 


3.76 


5.5 V 






v OL 


Iql - 20 /iA 


4.5 V 






V 


5.5 V 




C! 


Iql = 8 mA 


4.5 V 




£>' 0.45 


5.5 V 




°' 45 


'I 


V| = V CC or 0 


5.5 V 






ma 


•cc 


V| = V CC or 0, l 0 


5.5 V 








Cj 


V| = Vcc or 0 


5 V 






PF 


AIqC 1 


One input at 3.4 V, 
other inputs at 
0 or V cc 


5.5 V 




1 


mA 



Table 5. Register File Write Setup 



PARAMETER 


MIN MAX 


UNIT 




G5-C0 


4 






DA/B32-DA/B0, PA/B3-PA/BO 


7 






I7-I4 


13 „A 






UEY3-OEY0 


, ,,„ij|„iiiii — 






Y31-Y0 


4 JfP 






WE3-WE0 


4 /% " 


ns 




SELRF(DA,DB,PA,PB) 


%t r 






SELRF(Y) 


Ofs- 

sj# f 






SIO 


10 






SELMQ 


9 






IESIO3-IESIO0 


10 




t'h 


ALL 


0 





*This is the increase in supply current for each input that is at one of the specified TTL voltge levels rather 
then 0 V to Vcc- 
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Table 6. Maximum Switching Characteristics 



PARAMETER 


FROM (INPUT) 


TO (OUTPUT) 


UNIT 


Y 


C 


Z', 


SIO 


PERRA/B 


N 


OVR 


PA/B 
OA/B 


PY 


PERRY 


MSERR 




A5-AO.B5-BO 


36 


30 


37 


28 




30 


37 


16 


37 






ns 


DA3 1 -DA0.PA3-PA0 
DB31 -DB0.PB3-PB0 


36 


25 


37 


25 


20 


28 


37 




37 








30 


22 


31 


24 




28 


28 




32 






Ia 


37 


28 


37 


25 




31 


37 




37 






EB1-EBO 


37 


28 


37 


25 




31 


37 




37 






I7-IO 


37 


30 


37 


28 




32 




■ 


37 






CF2-CFO 


37 


30 


37 


28 




32, 






37 






OPR OFA 












... 




1 5 








HFY^-OFYO 


20 
















20 






SELMQ 


15 
















20 






SIO3-SIO0 


1 5 




25 






25 






27 






CLK 


21 
















28 






CLKMQ 


37 
















37 






RGLK 


37 


32 


37 


24 




32 


37 




37 






IESI03-IESIOO 


15 




25 






25 






27 






SSF 


25 




30 


22 




30 


22 




30 






Y 




















15 


15 



'ACT8832 Registered ALU 

The SN74ACT8832 is a 32-bit registered ALU that can be configured to operate as 
four 8-bit ALUs, two 16-bit ALUs, or a single 32-bit ALU. The processor instruction 
set is 100 percent upwardly compatible with the 'AS888 and includes 1 3 arithmetic 
and logical functions with 8 conditional shifts, multiplication, division, normalization, 
add and subtract immediate, bit and byte operations, and data conversions such as 
BCD, excess-3, and sign magnitude. New instructions permit internal flip-flops 
controlling BCD and divide operations to be loaded or read. 

Additional functions added to the 'ACT8832 include byte parity and master/slave 
operation. Parity is checked at the three data input ports and generated at the Y output 

mm port. The 64-word register file is 36 bits wide to permit storage of the parity bits. 

Eg] Master/slave comparator circuitry is provided at the Y port. 

The DA and DB ports can simultaneously input data to the ALU and the 64-word by 
^ 36-bit register file. Data and parity from the register file can be output on the DA and 

DB ports. Results of ALU and shift operations are output at the bidirectional Y port. 
^ The Y port can also be used in an input mode to furnish external data to the register 
^ file or during master/slave operation as an input to the master/slave comparator. 

Three 6-bit address ports allow a two-operand fetch and an operand write to be 
00 performed at the register file simultaneously. An MQ shifter and MQ register can also 
be configured to function independently to implement double-precision 8-bit, 1 6-bit, 
and 32-bit shift operations. An internal ALU bypass path increases the speeds of 
multiply, divide and normalize instructions. The path is also used by 'ACT8832 
instructions that permit bits and bytes to be manipulated. 

Architecture 

Figure 4 is a functional block diagram of the 'ACT8832. Control input signals are 
summarized in Table 7. Data flow and details of the functional elements are presented 
in the following paragraphs. 
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Table 7. ACT8832 Response to Control Inputs 



SIGNAL 


HIGH 


LOW 


CF2-CF0 


See Table 11 


See Table 1 1 


EA 


Selects external DA bus 


Selects register file 


EB1-EB0 


See Table 9 


See Table 9 


IESIO3-IESIO0 


Normal operation 


Force corresponding SIO 
inputs to high impedance 


I7-I0 


See Table 1 5 


See Table 1 5 


MQSEL 


Selects MQ register 


Selects ALU 


OEA 


Inhibits DA and PA output 


Enables DA and PA output 


OEB 


Inhibits DB and PB output 


Enables DB and PB output 


OEY3-OEY0 


Inhibits Y and PY outputs 


Enables Y and PY outputs 


RFSEL1-RFSEL0 


See Table 8 


See Table 8 


SSF 


Selects shifted ALU output 


Selects ALU (unshifted) output 


TP1-TP0 


See Table 14 


See Table 1 4 


WE3-WE0 


Inhibits register file write 


Byte enables for register file 
write (0 = LSB) 



Data Flow 

As shown in Figure 5, data enters the 'ACT8832 from three primary sources: the 
bidirectional Y port, which is used in an input mode to pass data to the register file; 
and the bidirectional DA and DB ports, used to input data to the register file or the 
R and S buses serving the ALU. Three associated I/O ports (PY, PA, and PB) are provided 
for associated parity data input and output. 

Data is input to the ALU through two multiplexers: R MUX, which selects the R bus 
operand from the DA port or the register file addressed by A5-A0; and S MUX, which 
selects data from the DB port, the register file addressed by B5-B0, or the multiplier- 
quotient (MQ) register. 

The result of the ALU operation is passed to the ALU shifter, where it is shifted or 
passed without shift to the Y bus for possible output from the 'ACT8832 and to the 
feedback MUX for possible storage in the internal register file. The MQ shifter, which 
operates in parallel with the ALU shifter, can be loaded from the ALU or the MQ register. 
The MQ shift result is passed to the MQ register, where it can be routed through the 
S MUX to the ALU or to the Y MUX for output from the chip. 

An internal bypass path allows data from the S MUX to be loaded directly into the 
ALU shifter or the divide/BCD flip-flops. Data from the divide/BCD flip-flops can be 
output via the MQ register. 
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SELRF1- ^ / 
SELRFO r 's 
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2 
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O 
H 
00 
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BYOF3- 
BYOF0 



PERRY PY3-PY0 OEY3-OEY0 



Figure 4. ACT8832 32-Bit Registered ALU 
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SELRF1- ^ / 
SELRFO ^ ?2 



RF 
DECODE 



PA3-PA0 «► 



EA 



DA31-DA0 4 > 



C n 



Si 

OL 



PARITY 
GENERATE 



7 



-"36 



REGISTER 
FILE 
64 X 36 



< 



32 I I I I 

R MUX ^ S MUX f - 



NERATE yL±_ g: 
1 32 uti- 

i UJ 



u 

Q «b 

in * 



\ Y MUX / 

3T 



32 



-« — WE3-WE0 
-4— C5-C0 



— B5-B0 



4— RFCLK 

-/^- 4^— PB3-PB0 



-4— EB1-EB0 



4» DB31-DB0 



PY3-PY0 OEY3-OEY0 Y31-Y0 

Figure 5. Data I/O 

Data can be output from the three bidirectional ports, Y, DA, and DB, and their 
associated parity ports, PY, PA, and PB. DA and DB can also be used to read ALU 
input data on the R and S buses for debug or other special purposes. 
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Architectural Elements 

Three-Port Register File 

The register file is 36 bits wide, permitting storage of a 32-bit data word with its 
associated parity bits. The 64 registers are accessed by three address ports. C5-C0 
address the destination register during write operations; A5-A0 and B5-B0 address 
any two registers during read operations. The address buses are also used to furnish 
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immediate data to the ALU: A3-A0 to provide constant data for the add and subtract 
immediate instructions; C3-C0 and A3-A0 to provide masks for set, reset, and test 
bit operations. 

Data is written into the register file when the write enable is low and a low-to-high 
register file clock (RFCLK) transition occurs. The separate register file clock allows 
multiple writes to be performed in one master clock cycle, allowing processors in multi- 
processor environments to update one another's internal register files during a single 
cycle. 

Four write enable inputs are provided to allow separate control of data inputs in a byte- 
oriented system. WE3 is the write enable for the most significant byte. 

Register file inputs are selected by the RF MUX under the control of two register file 
select signals, RFSEL1 and RFSELO, shown in Table 8 (see also Table 10). 

Table,, 8. RF MUX Select Inputs 



RFSEL1 


RFSELO 


SOURCE 


0 


0 


External DA input 


0 


1 


External DB input 


1 


0 


Y-output MUX 


1 


1 


External Y port 



R and S Multiplexers 

ALU inputs are selected by the R and S multiplexers. Controls which affect operand 
selection for instructions other than those using constants or masks are shown in 
Table 9. 



Table 9. ALU Source Operand Selects 



R-BUS 
OPERAND 
SELECT 
EA 


S-BUS 
OPERAND 
SELECT 
EB1-EB0 


RESULT 
DESTINATION 


^-SOURCE OPERAND 


0 




R bus 


'•-Register file addressed by A5-A0 


1 




R bus 


«-DA port 




0 0 


S bus 


^-Register file addressed by B5-B0 




1 0 


S bus 


*-DB port 




X 1 


S bus 


«-MQ register 
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Table 10. Destination Operand Select/Enables 



REGISTER 
FILE 
WRITE 
ENABLE 

Wt 


Y BUS 
OUTPUT 
ENABLE 
OEY 


Y MUS 
SELECT 
MQSEL 


REGISTER 
FILE 
SELECT 
RFSEL1-RFSEL0 


DA 
PORT 
OUTPUT 
ENABLE 

UtA 


DB 
PORT 
OUTPUT 
ENABLE 

ACQ 


RESULT 
DESTINATION 




SOURCE 


1 


0 


0 


X 


X 






Y/PY 




ALU shifter/parity generate 


1 


o 


1 


X 


X 






Y/PY 




MQ register/parity generate 


0 


0 


0 


1 


0 






Y/PY, RF 




ALU shifter/parity generate 


0 


0 


1 


1 


0 






Y/PY, RF 




MQ register/parity generate 


0 


1 


X 


1 


1 






RF 




External Y/PY 


0 


X 


X 


0 


0 


1 


X 


RF 




External DA/PA 


0 


X 


X 


0 


1 


X 


1 


RF 




External DB/PB 












0 




DA/PA 




R bus register file output 












1 




DA/PA 




Hi-Z 














0 


DB/PB 




S bus register file output 














1 


DB/PB 




Hi-Z 



SN74ACT8832 



Data Input and Output Ports 



The DA and DB ports can be used to load the S and/or R multiplexers from an external 
source or to read S or R bus outputs from the register file. The Y port can be used 
to load the register file and to output the next address selected by the Y output 
multiplexer. Tables 9 and 1 0 describe the MUX and output controls which affect DA, 
DB, and Y. 



ALU 



The ALU can perform seven arithmetic and six logical instructions on the two 32-bit 
operands selected by the R and S multiplexers. It also supports multiplication, division, 
normalization, bit and byte operations and data conversion, including excess-3 BCD 
arithmetic. The 'ACT8832 instruction set is summarized in Table 1 5. 

C/> The 'ACT8832 can be configured to operate as a single 32-bit ALU, two 1 6-bit ALUs, 
or four 8-bit ALUs (see Figures 6 and 7). It can also be configured to operate on a 

^ 32-bit word formed by adding leading zeros to the 1 2 least significant bits of R bus 
data. This is useful in certain IBM relative addressing schemes. 

O 



00 
00 



\r mux/ 



V mux / 



A\ 



SI02H 



ALU 
SHIFTER j 



7 \ Jq / 

A— 1 A SHIFTER /—I 



A, 



fie fi< 



MQ 
REGISTER 



MUX J - 



ALU 7 \ MQ 7 

SHIFTER/— J _X SHIFTER/ 1 







MQ 
REGISTER 



8YOF3 Y15-Y0 

Figure 6. 1 6- Bit Configuration 
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Configuration modes are controlled by three CF inputs as shown in Table 1 1 . These 
signals also select the data from which status signals other than byte overflow will 
be generated. 



Table 1 1 . Configuration Mode Selects 



CONTROL INPUTS 


MODE SELECTED 


DATA FROM WHICH STATUS OTHER 
THAN BYOF WILL BE GENERATED 


CF2 CF1 


CFO 


0 0 


0 


Four 8-bit 


Byte 0 


0 0 


1 


Four 8-bit 


Byte 1 


0 1 


0 


Four 8-bit 


Byte 2 


0 1 


1 


Four 8-bit 


Byte 3 


1 0 


0 


Two 16-bit 


Least significant 16-bit word 


10 


1 


Two 16-bit 


Most significant 16-bit word 


1 1 


0 


One 32-bit 


32-bit word 


1 1 


1 


Masked 32-bit 


32-bit word 



> 

o 

H ALU and MQ Shifters 
00 

2j The ALU and MQ shifters are used in all of the shift, multiply, divide and normalize 
N) functions. They can be used independently for single precision or concurrently for 

double precision shifts. Shifts can be made conditional, using the Special Shift Function 

(SSF) pin. 



Bidirectional Serial I/O Pins 

Four bidirectional SIO pins are provided to supply an end fill bit for certain shift 
instructions. These pins may also be used to read bits that are shifted out of the ALU 
or MQ shifters during certain instructions. Use of the SIO pins as inputs or outputs 
is summarized in Table 17. 

The four pins allow separate control of end fill inputs in configurations other than 32-bit 
mode (see Table 12 and Figure 4). 



Table 12. Data Determining SIO Input 



SIGNAL 


CORRESPONDING WORD, PARTIAL WORD OR BYTE 


32-BIT MODE 


16-BIT MODE 


8-BIT MODE 


SI03 






Byte 3 


si02 




most significant word 


Byte 2 


SIOT 






Byte 1 


SIOO 


32-bit word 


least significant word 


Byte 0 
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To increase system speed and reduce bus conflict, four SIO input enables 
(IESI03-IESIO0) are provided. A low on the se e nables will override internal pull-up 
resistor logic and force the corresponding SIO pins to the hig h im pedance state 
required before an input signal can appear on the signal line. If the SIO enables are not 
used, this condition is generated internally in the chip. Use of the enables allow internal 
decoding to be bypassed, resulting in faster speeds. 



The lESIOs are defaulted to a high because of internal pull-up resistors. When an 
SIO pin is used as an output, a low on its corresponding IESIO pin would force 
SIO to a high impedance state. The output would then be lost, but the internal 
operation of the chip would not be affected. 

MQ Register 

Data from the MQ shifter is written into the MQ register when a low-to-high transition 

occurs on clock CLK. The register has specific functions in double precision shifts, ^ 

multiplication, division and data conversion algorithms and can also be used as a 00 

temporary storage register. Data from the register file and the DA and DB buses can ^ 

be passed to the MQ register through the ALU. h- 

O 

The Y bus contains the output of the A LU s hifter if MQSEL is low and the output of < 
the MQ register if MQSEL is high. If OEY is low, ALU or MQ shifter output will 



be passed to the Y port; if OEY is high, the Y port becomes an input to the 
feedback MUX. (/) 



Conditional Shift Pin 

Conditional shifting algorithms may be implemented using the SSF pin under hardware 
or firmware control. If the SSF pin is high or floating, the shifted ALU output will be 
sent to the output buffers. If the SSF pin is pulled low externally, the ALU result will 
be passed directly to the output buffers, and MQ shifts will be inhibited. Conditional 
shifting is useful for scaling inputs in data arrays or in signal processing algorithms. 



Master/Slave Comparator 

A master/slave comparator is provided to compare d ata by tes from the Y output MUX 
with data bytes on the external Y port when OEY is high. If the data are 
not equal, a high signal is generated on the master slave error output pin (MSERR). 
A similar comparator is provided for the Y parity bits. 



Divide/BCD Flip-Flops 

Internal multiply/divide flip-flops are used by certain multiply and divide instructions 
to maintain status between instructions. Internal excess-3 BCD flip-flops preserve the 
carry from each nibble in excess-3 BCD operations. The BCD flip-flops are affected 
by all instructions except NOP and are cleared when a CLR instruction is executed. 
The flip-flops can be loaded and read externally using instructions LOADFF and DUMPFF 
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(see Table 15). This feature permits an iterative arithmetic operation such as 
multiplication or division to be interrupted immediately so that an external interrupt 
can be processed. 



Status 
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Eight status output signals are generated by the 'ACT8832. Four signals 
(BYOF3-BYOF0) indicate overflow conditions in certain data bytes (see Table 13). The 
others represent sign (N), zero (ZERO), carry-out (Cout) and overflow (OVR). N, ZERO, 
Cout, and OVR are generated from data selected by the mode configuration controls 
(CF2-CF0) as shown in Table 1 1 . 

I Carry-out is evaluated after each ALU operation. Sign and zero status are evaluated 
after ALU shift operation. Overflow (OVR) is determined by ORing the overflow result 
I from the ALU with the overflow result from the ALU shifter. 

Table 13. Data Determining BYOF Outputs 



SIGNAL 


CORRESPONDING WORD, PARTIAL WORD OR BYTE 


32-BIT MODE 


16-BIT MODE 


8-BIT MODE 


BYOF3 


32-bit word 


most significant word 


Byte 3 


BYOF2 






Byte 2 


BYOF1 




least significant word 


Byte 1 


BYOFO 






Byte 0 



Input Data Parity Check 

An even parity check is performed on each byte of input data at the DA, DB and Y 
ports. The check is performed by counting the number of ones in each byte and its 
corresponding parity bit. Parity bits are input on PA for DA data, PB for DB data and 
PYF or Y data. PAO, PBO and PYO are the parity bits for the least significant bytes 
of DA, DB and Y, respectively. If the result of the parity count is odd for any byte, 
a high appears at the parity error output pin (PERRA for DA data, PERRB for DB data, 
PERRY for Y data). 



Test Pins 

Two pins, TP1 -TP0, support system testing. These may be used, for example, to place 
all outputs in a high-impedance state, isolating the chip from the rest of the system 
(see Table 14). 
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Table 14. Test Pin Inputs 



TP1 


TPO 


RESULT 


0 


0 


All outputs and l/Os forced low 


0 


1 


All outputs and l/Os forced high 


1 


0 


All outputs and l/Os placed in a high impedance state 


1 


1 


Normal operation (default state) 



Instruction Set Overview 

Bits I7-I0 are used as instruction inputs to the 'ACT8832. Table 15 lists all 
instructions, divided into five groups, with their opcodes and mnemonics. 



Table 15. ACT8832 Instruction Set 



GROUP 1 INSTRUCTIONS 


INSTRUCTION BITS 






I3-I0 


MNEMONIC 


FUNCTION 


(HEX) 






0 




Used to access Group 4 instructions 


1 


ADD 


R + S + Cn 


2 


SUBR 


R + S + Cn 


3 


SUBS 


R + S + Cn 


4 


INCS 


S + Cn 


5 


INCNS 


S + Cn 


6 


INCR 


R + Cn 


7 


INCNR 


R + Cn 


8 




Used to access Group 3 instructions 


9 


XOR 


R XOR S 


A 


AND 


R AND S 


B 


OR 


R OR S 


C 


NAND 


R NAND S 


D 


NOR 


R NOR S 


E 


ANDNR 


RAND S 


F 




Used to access Group 5 instructions 



CM 
CO 
00 
00 

O 
< 

z 
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Table 15. 'ACT8832 Instruction Set (Continued) 



GROUP 2 INSTRUCTIONS 


INSTRUCTION RIT<» 






17-10 


MNEMONIC 

IllllklilVlllw 


FUNCTION 


(HEX) 






0 


SRA 


Arithmptic rinht <;innlp nrpri^inn ^hift 

r^l HI II 1 Iwllv 1 IMI 11 Oil IMIw Ul vwltflUII wl II 1 l 


1 


SRAD 


ArithmPtip rinht HoiiKIp nrppicinn <hift 

AAI III II 1 ICUls liyill UUUUIC piCOIOlUII Olllll 


2 


OnL. 


1 Ani^sl rinht cirtfilo rSrasMcinn ohift 

L.uyiv>cii iiyni oiiiym prcuioiuii oiiiri 


3 


SRLD 


L na\c*ft\ rinht Hnuhlp nrppi<sinn chift 

LU^IWul 1 1 1 1 1- UV/UUIw yjl CV/IOIUI 1 0 1 III I 


4 


SLA 


Arithmptif* Ipft cinnlp nrppicinn chift 


5 


SLAD 


Arithmptip Ipft Hmihlp nrppicinn chift 

A*M III II lie III* ICl l UUUUIC UlCvlOlUM olllll 


6 


SIX 


f*irpnlsir Ipft cinnlp nrppicinn chift 
v#iiv*uiai ici i diiiyic uigvIoiuii oiuii 


7 


SLCD 


Circular left double precision shift 

VII vUlMl Iwl I VJV/Vjk/Iw pi vwlwlvl 1 OlIIIL 


8 


SRC 


Circular right single precision shift 


9 


SRCD 


Circular right double precision shift 


A 


MQSRA 


Arithmetic right shift MQ register 


B 


MQSRL 


Logical right shift MQ register 


C 


MQSLL 


Logical left shift MQ register 


D 


MQSLC 


Circular left shift MQ register 


E 


LOADMQ 


Load MQ register 


F 


PASS 


Pass ALU to Y 



Table 15. ACT8832 Instruction Set (Continued) 



GROUP 3 INSTRUCTIONS 


INSTRUCTION BITS 






I7-I0 


MNEMONIC 


FUNCTION 


(HEX) 






08 


SET1 


Set bit 1 


18 


SETO 


Set bit 0 


28 


TB1 


Test bit (one) 


38 


TBO 


Test bit (zero) 


48 


ABS 


Absolute value 


58 


SMTC 


Sign magnitude/two's complement 


68 


ADD I 


Add immediate 


78 


SUBI 


Subtract immediate 


88 


BADD 


Byte add R to S 


98 


BSUBS 


Byte subtract S from R 


A8 


BSUBR 


Byte subtract R from S 


B8 


BINCS 


Byte increment S 


C8 


BINCNS 


Byte increment negative S 


D8 


BXOR 


Byte XOR R and S 


E8 


BAND 


Byte AND R and S 


F8 


BOR 


Byte OR R and S 
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Table 15. 'ACT8832 Instruction Set (Continued) 



CO 
2 



GROUP 4 INSTRUCTIONS 


INSTRUCTION BITS 






17-10 


MNEMONIC 


FUNCTION 

ryiiv • ivn 


(HEX) 






00 


CRC 


f \/plip rprii mrtziripx/ pharsptpr appiimulptinn 
w y who i cuui luui i**/ wi icii ov» id gv»v*ui i luiaiivi i 


10 




Qplprt Q nr R 


90 




oniric idiyui mui 1 1 idiittj 


^o 




UUUUIC IClly III 1 Ivm 1 1 10II/.6 


4.0 

"TV/ 


DIVRF 


Di\/iHp rpmainHpr fiy 

L/IVIUC IGlllCHllUd IIA 


50 


SDIVQF 


Sianpd Hix/irip nuntipnt fiy 


60 


SMULI 


Sifinpri mnltinlv itpratp 


70 


SMULT 


Signed multiply terminate 


80 


SDIVIN 


Signed divide initialize 


90 


SDIVIS 


Signed divide start 


AO 


SDIVI 


Signed divide iterate 


B0 


UDIVIS 


Unsigned divide start 


CO 


UDIVI 


Unsigned divide iterate 


DO 


UMULI 


Unsigned multiply iterate 


EO 


SDIVIT 


Signed divide terminate 


FO 


UDIVIT 


Unsigned divide terminate 
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Table 15. 'ACT8832 instruction Set (Continued) 



GROUP 5 INSTRUCTIONS 


INSTRUCTION BITS 






I7-I0 


MNEMONIC 


FUNCTION 


(HEX) 






OF 


LOADFF 


Load divide/BCD flip-flops 


1F 


CLR 


Clear 


2F 


CLR 


Clear 


3F 


CLR 


Clear 


4F 


CLR 


Clear 


5F 


DUMPFF 


Output divide/BCD flip-flops 


6F 


CLR 


Clear 


7F 


BCDBIN 


BCD to binary 


8F 


EX3BC 


Excess-3 byte correction 


9F 


EX3C 


Excess-3 word correction 


AF 


SDIVO 


Signed divide overflow test 


BF 


CLR 


Clear 


CF 


CLR 


Clear 


DF 


BINEX3 


Binary to excess-3 


EF 


CLR 


Clear 


FF 


NOP 


No operation 



Group 1 , a set of ALU arithmetic and logic operations, can be combined with the user- 
selected shift operations in Group 2 in one instruction cycle. The other groups contain 
instructions for bit and byte operations, division and multiplication, data conversion, 
and other functions such as sorting, normalization and polynomial code accumulation. 



Arithmetic/Logic Instructions with Shifts 

The seven Group 1 arithmetic instructions operate on data from the R and/or S 
multiplexers and the carry-in. Carry-out is evaluated after ALU operation; other status 
pins are evaluated after the accompanying shift operation, when applicable. Group 1 
logic instructions do not use carry-in; carry-out is forced to zero. 

Possible shift instructions are listed in Group 2. Fourteen single and double precision 
shifts can be specified, or the ALU result can be passed unshifted to the MQ register 
or to the specified output destination by using the LOADMQ or PASS instructions. 
Table 1 6 lists shift definitions. 

When using the shift registers for double precision operations, the least significant 
half should be placed in the MQ register and the most significant half in the ALU for 
passage to the ALU shifter. An example of a double-precision shift using the ALU and 
MQ shifters is given in Figure 8. 
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SERIAL DATA 
INPUT SIGNALS 



SIOO 



1 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 | 



~n 

\,31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 o/ 

Single Precision Logical Right Single Shift, 32-Bit Configuration 



SERIAL DATA 
INPUT SIGNALS 



SIO0- 



z 

> 

o 

H 
00 
00 
Ca> 
N3 



\31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0/ 

l : : ~ ' mo — ' ■ n 

|31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 | 



\31 30 



29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0, 



7 



Double Precision Logical Right Single Shift, 32-Bit Configuration 

Figure 8. Shift Examples, 32-Bit Configuration 

All Group 2 shifts can be made conditional using the conditional shift pin (SSF). If the 
SSF pin is high or floating, the shifted ALU output will be sent to the output buffers, 
MQ register, or both. If the SSF pin is pulled low, the ALU result will be passed directly 
to the output buffers and any MQ shifts will be inhibited. 

Table 16. Shift Definitions 



SHIFT TYPE 



Left 
Right 

Arithmetic right 

Arithmetic left 

Circular right 
Circular left 
Logical right 

Logical left 



NOTES 



Moves a bit one position towards the most significant bit 
Moves a bit one position towards the least significant bit 
Retains the sign unless an overflow occurs, in which case, the 
sign would be inverted 

May lose the sign bit if an overflow occurs. Zero is filled into 
the least significant bit unless the bit is set externally 
Fills the least significant bit in the most significant bit position 
Fills the most significant bit in the least significant bit position 
Fills a zero in the most significant bit position unless the bit 
is forced to one by placing a zero on an SIO pin 
Fills a zero in the least significant bit position unless the bit 
is forced to one by placing a zero on an SIO pin 
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The bidirectional SIO pins can be used to supply external end fill bits for certain Group 2 
shift instructions. When SIO is high or floating, a zero is filled, otherwise a 1 is filled 
Table 17 lists instructions that make use of the SIO inputs and identifies input and 
output functions. 



Table 17. Bidirectional SIO Pin Functions 





SIO 


RITQ 17-in 
DUO 1 /-IV/ 


MNEMONIC 


I/O 


DATA 




SRA 


0 


Shift out 


1 * 

1 


SRAD 


0 


Shift out 


0 * 


SRL 


1 


Most significant bit 


o* 
o 


SRLD 


1 


Most significant bit 


A* 

*T 


SLA 


1 


Least significant bit 


K * 


SLAD 


1 


Least significant bit 


* o 


SLC 


0 


Shifted input to MQ shifter 


7* 


SLCD 


0 


Shifted input to MQ shifter 


ft* 
o 


SRC 


0 


Shifted input to ALU shifter 




SRCD 


0 


Shifted input to ALU shifter 


A* 


MQSRA 


0 


Shift out 


B* 


MQSRL 


1 


Most significant bit 


c* 


MQSLL 


1 


Least significant bit 


u 


MQSLC 


0 


Shifted input to MQ shifter 


nn 


CRC 


0 


Internally generated end fill bit 


20 


CMADIlll 

oNUnlvl 


1 


Least significant bit 


30 


DNORM 


1 


Least significant bit 


60 


SMULI 


0 


ALUO 


70 


SMULT 


0 


ALUO 


80 


SDIVIN 


0 


Internally generated end fill bit 


90 


SDIVIS 


0 


Internally generated end fill bit 


AO 


SDIVI 


0 


Internally generated end fill bit 


B0 


UDIVIS 


0 


Internally generated end fill bit 


CO 


UDIVI 


0 


Internally generated end fill bit 


DO 


UMULI 


0 


Internal input 


E0 


SDIVT 


0 


Internally generated end fill bit 


F0 


UDIVIT 


0 


Internally generated end fill bit 


7F 


BCDBIN 


1 


Least significant bit 


DF 


BINEX3 


0 


Shifted input to MQ register 
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Other Arithmetic Instructions 



CO 

> 

O 
H 
00 
00 
CO 



The 'ACT8832 supports two immediate arithmetic operations. ADD! and SUB! 
(Group 3) add or subtract a constant between the values of 0 and 1 5 from an operand 
on the S bus. The constant value is specified in bits A3-A0. 

Twelve Group 4 instructions support serial division and multiplication. Signed, unsigned 
and mixed multiplication are implemented using three instructions: SMULI, which 
performs a signed times unsigned iteration; SMULT, which provides negative weighting 
of the sign bit of a negative multiplier in signed multiplication; and UMULI, which 
performs an unsigned multiplication iteration. Algorithms using these instructions are 
given in Tables 18, 19, and 20. These include: signed multiplication, which performs 
a two's complement multiplication; unsigned multiplication, which produces an 
unsigned times unsigned product; and mixed multiplication which multiplies a signed 
multiplicand by an unsigned multiplier to produce a signed result. 

Table 18. Signed Multiplication Algorithm 



OP 




CLOCK 


INPUT 


INPUT 


OUTPUT 


CODE 


MNEMONIC 


CYCLES 


S PORT 


R PORT 


Y PORT 


E4 


LOADMQ 


1 


Multiplier 




Multiplier 


60 


SMULI 


N-1 f 


Accumulator 


Multiplicand 


Partial product 


70 


SMULT 


1 


Accumulator 


Multiplicand 


Product (MSH)* 


Table 19. Unsigned Multiplication Algorithm 


OP 




CLOCK 


INPUT 


INPUT 


OUTPUT 


CODE 


MNEMONIC 


CYCLES 


S PORT 


R PORT 


Y PORT 


E4 


LOADMQ 


1 


Multiplier 




Multiplier 


DO 


UMULI 


N-r 1 " 


Accumulator 


Multiplicand 


Partial product 


DO 


UMULI 


1 


Accumulator 


Multiplicand 


Product (MSH)* 


Table 20. Mixed Multiplication Algorithm 


OP 




CLOCK 


INPUT 


INPUT 


OUTPUT 


CODE 


MNEMONIC 


CYCLES 


S PORT 


R PORT 


Y PORT 


E4 


LOADMQ 


1 


Multiplier 




Multiplier 


60 


SMULI 


N-l* 


Accumulator 


Multiplicand 


Partial product 


60 


SMULI 


1 


Accumulator 


Multiplicand 


Product (MSH)* 



f N - 8 for quad 8-bit mode, 16 for dual 16-bit mode, 32 for 32-bit mode. 
*The least significant half of the product is in the MQ register. 



3-46 



Instructions that support division include start, iterate and terminate instructions for 
unsigned division routines (UDIVIS, UDIVI and UDIVIT); initialize, start, iterate and 
terminate instructions for signed division routines (SDIVIN, SDIVIS, SDIVI and SDIVIT); 
and correction instructions for these routines (DIVRF and SDIVQF). A Group 5 
instruction, SDIVO, is available for optional overflow testing. Algorithms for signed 
and unsigned division are given in Tables 21 and 22. These use a nonrestoring 
technique to divide a 1 6 N-bit integer dividend by an 8 N-bit integer divisor to produce 
an 8 N-bit integer quotient and remainder, where N = 1 for quad 8-bit mode, N = 2 
for dual 16-bit mode, and N = 4 for 32-bit mode. 



Table 21. Signed Division Algorithm 



OP 
CODE 


MNEMONIC 


CLOCK 
CYCLES 


INPUT 
S PORT 


INPUT 
R PORT 


OUTPUT 
Y PORT 


E4 


LOADMQ 


1 


Dividend (LSH) 




Dividend (LSH) 


80 


SDIVIN 


1 


Dividend (MSH) 


Divisor 


Remainder (N) 


AF 


SDIVO 


1 


Remainder (N) 


Divisor 


Overflow Test 












Result 


90 


SDIVIS 


1 


Remainder (N) 


Divisor 


Remainder (N) 


AO 


SDIVI 


N-2 f 


Remainder (N) 


Divisor 


Remainder (N) 


EO 


SDIVIT 


1 


Remainder (N) 


Divisor 


Remainder^ 


40 


DIVRF 


1 


Remainder* 


Divisor 


Remainder^ 


50 


SDIVQF 


1 


MQ register 


Divisor 


Quotient # 



*N = 8 for quad 8-bit mode, 16 for dual 16-bit mode, 32 for 32-bit mode. 
*The least significant half of the product is in the MQ register. 
§ Unfixed 

1 Fixed (corrected) 

#The quotient is stored in the MQ register. Remainder can be output at the Y port or stored in 
the register file accumulator. 



Table 22. Unsigned Division Algorithm 



OP 
CODE 


MNEMONIC 


CLOCK 
CYCLES 


INPUT 
S PORT 


INPUT 
R PORT 


OUTPUT 
Y PORT 


E4 


LOADMQ 


1 


Dividend (LSH) 




Dividend (LSH) 


BO 


UDIVIS 


1 


Dividend (MSH) 


Divisor 


Remainder (N) 


CO 


UDIVI 


N-1 1 " 


Remainder (N) 


Divisor 


Remainder (N) 


FO 


UDIVIT 


1 


Remainder (N) 


Divisor 


Remainder* 


40 


DIVRF 


1 


Remainder § 


Divisor 


Remainder^ 



N = 8 in quad 8-bit mode, 16 in dual 16-bit mode, 32 in 32-bit mode 
Unfixed 

Fixed (corrected) 
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Data Conversion Instructions 



CS) 
2 
•^1 

> 

O 
H 
00 
00 
CO 



Conversion of binary data to one's and two's complement can be implemented using 
the INCNR instruction (Group 1). SMTC (Group 3) permits conversion from two's 
complement representation to sign magnitude representation, or vice versa. Two's 
complement numbers can be converted to their positive value, using ABS (Group 3). 

SNORM and DNORM (Group 4) provide for normalization of signed, single- and double- 
precision data. The operand is placed in the MQ register and shifted toward the most 
significant bit until the two most significant bits are of opposite value. Zeroes are shifted 
into the least significant bit, provided SIO is high or floating. (A low on SIO will shift 
a one into the least significant bit.) SNORM allows the number of shifts to be counted 
and stored in one of the register files to provide the exponent. 

Data stored in binary-coded decimal form can be converted to binary using BCDBIN 
(Group 5). A routine for this conversion, given in Table 23, allows the user to convert 
an N-digit BCD number to a 4N-bit binary number in 4N + 8 clock cycles. 

Table 23. BCD to Binary Algorithm 



OP 
CODE 



MNEMONIC 



CLOCK 
CYCLES 



INPUT 
S PORT 



INPUT 
R PORT 



OUTPUT 
DESTINATION 



E4 
D2 
D2 
D1 
68 

REPEAT 

DA 
D1 
7F 
7F 



LOADMQ 
SUBR/MQSLC 
SUBR/MQSLC 
MQSLC 
ADDI (15) 

N-1 TIMES 1 * 

AND/MQSLC 
ADD/MQSLC 
BCDBIN 
BCDBIN 



END REPEAT 



FA 
D1 



AND 

ADD MQSLC 



BCD operand 
Accumulator 
Mask reg. 
Don't care 
Accumulator 



MQ reg. 
Accumulator 
Interim reg. 
Accumulator 



MQ reg. 
Accumulator 



Accumulator 
Mask reg. 
Don't care 
Decimal 15 



Mask reg. 
Interim reg. 
Interim res. 
Interim reg. 



Mask reg. 
Interim reg. 



MQ reg. 

Accumulator/MQ reg. 
Mask reg/MQ reg. 
MQ reg. 
Mask reg. 



Interim reg/MQ reg. 
Interim reg/MQ reg. 
Accumulator/MQ reg. 
Accumulator/MQ reg. 



Interim reg. 
Accumulator 



Number of BCD digits 



BINEX3, EX3BC, and EX3C assist binary to excess-3 conversion. Using BINEX3, an 
N-bit binary number can be converted to an N/4- digit excess-3 number. For an 
algorithm, see Table 24. 
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Table 24. BCD to Binary Algorithm 



OP 
CODE 


MNEMONIC 


CLOCK 
CYCLES 


INPUT 
S PORT 


INPUT 
R PORT 


OUTPUT 
DESTINATION 


E4 
D2 
D2 

REPEAT 

DF 
9F 

END RE 


LOADMQ 
SUBR 

SET1 (33)16 

N TIMES t 

BINEX3 
EX3C 

>EAT 


1 
1 
1 

1 
1 


Binary number 

Accumulator 

Accumulator 

Accumulator 
Accumulator 


Accumulator 
Mask (33)16 

Accumulator 
Internal data 


MQ reg. 

Accumulator 

Accumulator 

Accumulator/MQ reg 
Accumulator 



*N = Number of bits in binary number 

CM 

Bit and Byte Instructions £2 

00 

Four Group 3 instructions allow the user to test or set selected bits within a byte. £2 
SET1 and SETO force selected bits of a selected byte (or bytes) to one and zero, O 
respectively. TB1 and TBO test selected bits of a selected byte (or bytes) for ones ^ 
and zeros. The bits to be set or tested are specified by an 8-bit mask formed by the 
concatentation of register file address inputs C3-G0 and A3-A0. The register file Z 
addressed by B5-B0 is used as the destination operand for the set bit instructions. ^ 
Register writes are inhibited for test bit instructions. Bytes to be operated on are 
selected by forcing SlOn low, where n represents the byte position and 0 represents 
the least significant byte. A high on the zero output pin signifies that the test data 
matches the mask; a low on the zero output indicates that the test has failed. 

Individual bytes of data can also be manipulated using eight Group 3 byte 
arithmetic/logic instructions. Bytes can be added, subtracted, incremented, ORed, 
ANDed and exclusive ORed. Like the bit instructions, bytes are selected by forcing 
SlOn low, but multiple bytes can be operated on only if they are adjacent to one another; 
at least one byte must be nonselected. 

Other Instructions 

SEL (Group 4) selects one of the ALU's two operands, S or R, depending on the state 
of the SSF pin. This instruction could be used in sort routines to select the larger or 
smaller of two operands by performing a subtraction and sending the status result 
to SSF. CRC (Group 4) is designed to verify serial binary data that has been transmitted 
over a channel using a cyclic redundancy check code. An algorithm using this instruction 
is given in Table 25. 
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Table 25. CRC Algorithm 



OP 
CODE 


MNEMONIC 


CLOCK 
CYCLES 


INPUT 
SPORT 


INPUT 
R PORT 


OUTPUT 
DESTINATION 


E4 
F6 
F2 

REPEAT 

00 
E4 

END RE 


LOADMQ 

INCR 

SUBR 

n/8N TIMES 1 " 
CRC 

LOADMQ 
>EAT 


1 
1 
1 

1 
1 


Vector c'(x)" 1 " 
Accumulator 

Accumulator 
Vector c'(x)t 


Polynomial g(x) 
Accumulator 

Poly reg. 


MQ reg. 
Poly reg. 
Accumulator 

Accumulator 
MQ reg. 



Mi t N = Number of bits in binary number 
n = Length of the code vector 

(/> 

Z 

CLR forces the ALU output to zero and clears the internal BCD flip-flops used in excess-3 
^ BCD operations. NOP forces the ALU output to zero, but does not affect the flip-flops. 
O 

H Configuration Options 

2 The 'ACT8832 can be configured to operate in 8-bit, 1 6-bit, or 32-bit modes, depending 
|sj on the setting of the configuration mode selects (CF2-CF0). Table 1 1 shows the control 
inputs for the four operating modes. Selecting an operating configuration other than 
32-bit mode affects ALU operation and status generation in several ways, depending 
on the mode selected. 

Masked 32-Bit Operation 

Masked 32-bit operation is selected to reset to zero the 20 most significant bits of 
the R Mux input. The 1 2 least significant bits are unaffected by the mask. Only Group 
1 and Group 2 instructions can be used in this operating configuration. Status 
generation is similar to unmasked 32-bit operating mode. 

Shift Instructions 

Shift instructions operate similarly in 8-bit, 16-bit, and 32-bit modes. The serial I/O 
(SIO3'-SIO0 f ) pins are used to select end-fill bits or to s hift b its in or out, depending 
on the operation being performed. Table 12 shows the SIO signals associated with 
each byte or word in the different modes, and Table 1 7 indicates the specific function 
performed by the SIO pins during shift, multiply, and divide operations. 

Figures 9 and 10 present examples of logical right shifts in 16-bit and 8-bit 
configurations. 
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SERIAL DATA 
INPUT SIGNALS 



SIOO- 



SI02 



\31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16/ \l5 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0/ 

Single Precision Logical Right Single Shift, 16-Bit Configuration 



SERIAL DATA 
INPUT SIGNALS 



SIOO 



SI02 



1 30 29 29 27 26 25 24 23 22 21 20 19 18 17 16 1 



\31 30 29 28 27 26 25 24 23 22 21 20 



19 18 17 16/ 



1 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 I 



\31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16/ 



TTT1 



H5 14 13 12 11 10 9 8 7 6 5 4 3 

\15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0/ 

[ ~~- ~mo -"■ I 

HS 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 

\15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0/ 



CM 
00 
00 
00 

O 
< 

z 

CO 



Double Precision Logical Right Single Shift, 16-Bit Configuration 

Figure 9. Shift Examples, 16-Bit Configuration 
Bit and Byte Instructions 

The 'ACT8832 performs bit operations similarly in 8-bit, 16-bit, and 32-bit modes. 
Masks are loaded into the R MUX on the A3-A0 and C3-C0 address inputs, and the 
bytes to be masked are selected by pulling their SIO' inputs low. Instructions which 
set, reset, or test bits are explained later 

Byte operations should be performed in 32-bit mode to get the necessary status 
outputs. While byte overflow signals are provided for all four bytes (BYOF3-BYOF0), 
the other status signals (C, N, Z) are output only for the word selected with the 
configuration control signals (CF2-CF0). 



Status Selection 

Status results (C, N, Z, and overflow) are internally generated for all words in all modes, 
but only the overflow results (BYOF3-BYOF0) are available for all four bytes in 8-bit 
mode or for both words in 1 6-bit mode. If a specific application requires that the four 
status results are read for two or four words, it is possible to toggle the configuration 
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SERIAL DATA 
INPUT SIGNALS 



SIO0-+- 

SI01 

SI02-*- 



|31 30 29 26 27 26 25 24 I 



S5E 



m?7 



\}1 30 29 28 27 26 25 24/ \23 22 21 20 19 18 17 16/ \lS 14 13 12 11 10 9 */ \j 6 5 4 3 2 1 o/ 

Single-Precision Logical Right Shift, 8-Bit Configuration 



SERIAL DATA 
INPUT SIGNALS 

SIO0 -» 

SIQ1 -> — 

SI02-* 



I ALU I 

131 30 29 28 27 26 25 24 1 



C/> SI03- 

2 

> 

O 
H 
00 
00 
(a) 



131 30 29 28 27 26 25 24 I 
\31 30 29 28 27 26 25 24/ 



I" ALU I 

1 23 22 21 20 19 18 17 16 1 



\31 30 29 28 27 26 25 24/ \23 22 21 20 19 18 17 1 



\23 22 21 20 19 18 17 16,/ 

I m5 1 

23 22 21 20 19 18 17 16 I 
\23 22 21 20 19 18 17 16/ 



I alu : 1 

15 14 13 12 11 10 9 6 J 
\l5 14 13 12 11 10 9 8/ 

I SET 1 

IS 14 13 12 11 10 9 8 
\15 14 13 12 11 10 9 8/ 



6 5 4 3 2 



3 ? 1 9 I 
3 2 1 0/ 



7 6 5 4 3 2 1 0 
\7 6 5 4 3 2 1 a/ 



Double-Precision Logical Right Shift, 8-Bit Configuration 

Figure 10. Shift Examples, 8-Bit Configuration 

control signals (CF2-CF0) within the same clock cycle and read the additional status 
results. This assumes that the necessary external hardware is provided to toggle 
CF2-CF0 and collect the status for the individual words before the next clock signal 
is input. 



Instruction Set 

The 'ACT8832 instruction set is presented in alphabetical order on the following pages. 
The discussion of each instruction includes a functional description, list of possible 
operands, data flow diagram, and notes on status and control bits affected by the 
instruction. Microcoded examples are also shown. 

Mnemonics and opcodes for instructions are given at the top of each page. Opcodes 
for instructions in Groups 1 and 2 are four bits long and are combined into eight-bit 
instructions which select combinations of arithmetic, logical/and shift operations. 
Opcodes for the other instruction groups are all eight bits long. 

An asterisk in the left side of the opcode box for a Group 1 instruction indicates that 
a Group 2 opcode is needed to complete the instruction. An asterisk in the right side 
of a box indicates that a Group 1 opcode is required to combine with the Group 2 
opcode in the left side of the box. 
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ABS 



Absolute Value 



FUNCTION 

Computes the absolute value of two's complement data on the S bus. 
DESCRIPTION 

Two's complement data on the S bus is converted to its absolute value. The carry 
must be set to one by the user for proper conversion. ABS causes S' + Cn to be 
computed; the state of the sign bit determines whether S or S' + Cn will be selected 
as the result. SSF is used to transmit the sign of S. 



Available R Bus Source Operands 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


No 


No 


No 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





CN 
CO 
00 
00 
h- 
O 
< 

z 



Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


SIOO 


No 


Inactive 


SI01" 


No 


Inactive 


SI02 


No 


Inactive 


SI03 


No 


Inactive 


Cn 


Yes 


Should be programmed high for proper conversion. 
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nrm 



Absolute Value 



ABS 



Status Signals 



ZERO = 1 if result = 0 

N = 1 if MSB (input) = 1 
OVR = 1 if input of most significant byte is 80 (Hex) and inputs (if any) in all 
other bytes are 00 (Hex). 
C = 1 if S = 0 



EXAMPLES (assumes a 32-bit configuration) 

Convert the two's complement number in register 1 to its positive value and store 
the result in register 4. 



Instr 


Oprd 


Oprd 


Oprd Sel 


□est 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


0100 1000 


XX XXXX 


00 0001 


X 00 


00 0100 


0 


0000 


10 X X 


XXXX 


0 


1 


110 



CO 
2 

> 

O 
H 
00 
00 
GO 

ro 



Example 1: Assume register file 1 holds F6D81340 (Hex): 



Source 1 1 1 1 01 10 1 101 1000 0001 001 1 0100 oooo S RF(1 ) 



Destination 



0000 1001 0010 01 1 1 1 1 10 1 100 1 100 0000 



RF(4) «- S + Cn 



Example 2: Assume register file 1 holds 09D527C0 (Hex): 



Source 00001001 1101 0101 00100111 11000000 S RF(1) 



Destination 0000 1001 1101 0101 00100111 11000000 RF(4) *- S 
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ADD 



Add with Carry (R + S + Cn) 



TJ 



FUNCTION 

Adds data on the R and S buses to the carry-in. 
DESCRIPTION 

Data on the R and S buses is added with carry. The sum appears at the ALU and MQ 
shifters. 

*The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14) of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 1 5. 



Available R Bus Source Operands 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3- AO 
Mask 


Yes 


No 


Yes 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





CM 
CO 
00 
00 

o 
< 

z 



Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


Yes 


Yes 
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Add with Carry (R + S + Cn) 



ADD 



Control/Data Signals 



Signal 



User 
Programmable 



Use 



SSF 



SIO0 
STOT 
SI02 
SI03 
Cn 



No 

No 
No 
No 
No 
Yes 



Affect shift instructions programmed in bits 17-14 of 

instruction field. 

Inactive 

Inactive 

Inactive 

Inactive 

Increments sum if set to one. 



Z 

> 

O 
H 
00 
00 
GO 

ro 



Status Signals ' 



ZERO = 1 if result = 0 

N = 1 if MSB = 1 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out = 1 



*C is ALU carry out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 

EXAMPLES (assumes a 32-bit configuration) 

Add data in register 1 to data on the DB bus with carry-in and pass the result to the 
MQ register. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-IO 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1110 0001 


00 0001 


XX XXXX 


0 10 


XX XXXX 


0 


1111 


10 X X 


XXXX 


0 


0 


110 



Assume register file 1 holds 0802C618 (Hex and DB bus holds 1E007530 (Hex): 

R *- RF(1) 



Source oooo 1000 oooo 0010110001100001 1000 



Source 0001 1110 oooo oooo 01 11 0101 001 1 oooo S *- DB bus 



Destination 0010 0110 oooo 0011 0011 1011 0100 1000 MQ register *- R +S + Cn 
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ADDI 



ADD Immediate 



rrm 



FUNCTION 

Adds four-bit immediate data on A3-A0 with carry to S-bus data. 
DESCRIPTION 

Immediate data in the range 0 to 15, supplied by the user at A3-A0, is added with 
carry to S. 



Available R Bus Source Operands (Constant) 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


Mask 


No 


Yes 


No 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





CM 
CO 
00 
00 

O 
< 

z 

CO 



Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


SIOO 


No 


Inactive 


STOT 


No 


Inactive 


SI02 


No 


Inactive 


Si03 


No 


Inactive 


Cn 


Yes 


Increments sum if set to one. 
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rmn 



ADD Immediate 



ADDI 



Status Signals 



ZERO = 1 if result = 0 

N = 1 if MSB = 1 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out = 1 



EXAMPLES (assumes a 32-bit configuration) 

Add the valule 1 2 to data on the DB bus with carry-in and store the result in register 
file 1 . 





Instr 


Oprd 


Oprd 


Oprd Sel 


Oest 




Destination Selects 










CO 


Code 


Addr 


Addr 


EB1- 


Addr 


WE3- 


SELRF1- 


OEY3 






CF2- 


z 


I7-I0 


A5-AQ 


B5-B0 


EA EBO 


C5-C0 


SELMQ WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 




0110 1000 


00 1100 


XX XXXX 


X 10 


00 0001 


0 0000 


10 X X 


XXXX 


0 


0 


110 



> Assume bits A5-A0 hold OC (Hex) and DB bus holds 24000100 (Hex): 
O 

H | : — • , 

00 Source oooo oooo oooo oooo oooo oooo oooo 1 100 R *- A 5- AO 

00 
CO 



Source 0010 0100 oooo oooo oooo 0001 oooo oooo S DB bus 



Destination 00100100000000000000,000100001100 RF(1) *- R +S + Cn 
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AND 



Logical AMD (R AND S) 



FUNCTION 

Evaluates the logical expression R AND S. 
DESCRIPTION 

Data on the R bus is ANDed with data on the S bus. The result appears at the ALU 
and MQ shifters. 

*The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14) of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 15. 



Available R Bus Source Operands 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


Yes 


No 


Yes 


No 


Available S Bus Source Operands 


RF 
(B5-BO) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





CM 
00 
00 
00 
h» 
O 
< 

z 



Available Destination Operands Shift Operations 



RF 
(C5-C0J 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


Yes 


Yes 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Affect shift instructions programmed in bits I7-I4 of 
instruction field. 


sioo 


No 


Inactive 


SIOT 


No 


Inactive 


Si02 


No 


Inactive 


SI03 


No 


Inactive 


Cn 


No 


Inactive 
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logical AMD (R AND S) 



AND 



Status Signals t 



ZERO 
N 

OVR 
C 



1 if result = 0 
1 if MSB = 1 
0 
0 



*C is ALU carry out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 



c/> 
z 

vj 

> 

O 



EXAMPLES (assumes a 32-bit configuration) 

Logically AND the contents of register 3 and register 5 and store the result 
in register 5. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Oest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1111 1010 


00 001 1 


00 0101 


0 00 


00 0101 


0 


0000 


10 X X 


XXXX 


0 


X 


110 



g Assume register file 3 holds F61 7D840 (Hex) and register file 5 holds 1 5F6D842 (Hex): 

W . 

Source 111101100001011111011000 01000000 R *- RF(3) 



Source 0001 0101 1111 01101101 100001000010 



RF(5) 



Destination 0001 01000001 0110 1101 1000 01000000 RF(5) *- R AND S 
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ANDNR 



Logic AMD Negative R (R f AND S) 



FUNCTION 

Computes the logical expression S AND NOT R. 
DESCRIPTION 

The logical expression S AND NOT R is computed. The result appears at the ALU and 
MQ shifters. 

*The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14) of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 1 5. 



Available R Bus Source Operands 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


Yes 


No 


Yes 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





CM 
CO 
00 
00 

O 
< 

z 

CO 



Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


Yes 


Yes 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Affect shift instructions programmed in bits I7-I4 of 
instruction field. 


sioo 


No 


Inactive 


sioT 


No 


Inactive 


sio5 


No 


Inactive 


SI03 


No 


Inactive 


Cn 


No 


Inactive 
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Logic AND Negative R (R' AMD S) 



ANDNR 



Status Signals* 



ZERO 


= 1 


N 


= 0 


OVR 


= 0 


C 


= 0 



*C is ALU carry out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 



EXAMPLE (assumes a 32-bit configuration) 

Invert the contents of register 3, logically AND the result with data in register 5 
and store the result in register 10. 

C/> 
Z 
vj 

> 
O 
H 
00 

00 Assume register file 3 holds 1 5F6D840 (Hex) and register file 5 hold F61 7D842 (Hex): 
(a) 



Instr 


Oprd 


Oprd 


Oprd Sel 


Oest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1111 1110 


00 0011 


00 0101 


0 00 


00 1010 


0 


0000 


10 X X 


XXXX 


0 


X 


110 



Source 0001 0101 1111 01101101 100001000000 R «- RF(3) 



Source 1 1 1 1 01 10 0001 01 1 1 1 101 1000 0100 0010 S RlF(5) 



Destination 1 1 10 0010 0000 0001 0000 0000 0000 0010 RF( 10) *- R AND S 
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BADD 



Byte Add R to S with Carry 



8 



FUNCTION 

Adds S with carry-in to a selected byte or selected adjacent bytes of R. 



DESCRIPTION 

SIO3-SIO0 are used to sele ct bytes of R to be added to the corresponding bytes of 
S. A byte of R with SIO programmed low is selected for the computation of 
R '+ S + Cn. If the SIO signal for a byte of R is left high, the corresponding byte 
of S is passed unaltered. Multiple bytes can be selected only if they are adjacent to 
one another. At least one byte must be nonselected. 



Available R Bus Source Operands 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


Yes 


No 


Yes 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 

(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


sioo 


Yes 


Byte select 


sioT 


Yes 


Byte select 


SI02 


Yes 


Byte select 


SI03 


Yes 


Byte select 


Cn 


Yes 


Propagates through nonselected bytes; increments 
selected byte(s) if programmed high. 
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8 8 



Byte Add R to S with Carry 



BADD 



Status Signals 



ZERO = 1 if result (selected bytes) = 0 

N = 0 

OVR = 1 if signed arithmetic overflow (selected bytes) 

C = 1 if carry-out (most significant selected byte) =1 



EXAMPLE (assumes a 32-bit configuration) 

Add bytes 1 and 2 of register 3 with carry to the contents of register 1 and store the 
result in register 11. 



Instr 
Code 
I7-I0 


Oprd 
Addr 
A5-A0 


Oprd 
Addr 
B5-B0 


Oprd Sel 
EB1- 
EA EBO 


Dest 
Addr 
C5-C0 


Destination Selects 
WE3- SELRF1- OEY3- 
SELMQ WEO SELRFO OEA OEB OEYO OES 


Cn 


CF2- 
CFO 






SI03- 

sioo 


IESI03- 


IESIO0 


0100 1000 


00 001 1 


00 0001 


0 00 


00 1011 


0 0000 10 X X XXXX 0 


1 


110 


1001 


0000 



(/> 
2 

> Assume register file 3 holds 2C01 8181 (Hex) and register file 1 holds 7A8FBE3E (Hex): 

O '- 

qq Source 0010 1100 0000 0001 1000 0001 1000 0001 Rn *- RF(3)n 
00 

^ Source 01111010100011111011111000111110 Sn*-RF(1)n 



Destination 



ALU 10100110 1001 0001 01000000 11000000 Fn *- Rn +Sn + Cn 

RF(11)n Fn or Sn* 



0111 1010 1001 0001 0100 1111 0011 ii-io- 



t F = ALU result 
n = nth byte 

Register file 1 1 gets F if byte selected, S if byte not selected. 
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BAND 



Byte AND R AND S (Byte Logical AND R AND S) LLH 



FUNCTION 

Evaluates the logical AND of selected bytes of R-bus and S-bus data. 
DESCRIPTION 

Byte s wit h their corresponding SIO signals programmed low compute R AND S. Bytes 
with SIO signals programmed high, pass S unaltered. Multiple bytes can be selected 
only if they are adjacent to one another. At least one byte must be nonselected. 



Available R Bus Source Operands 









C3-C0 


RF 
(A5-AO) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


Yes 


No 


Yes 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





CM 
00 
00 
00 
h- 
O 
< 

z 

CO 



Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Forced low 




SIOO 


Yes 


Byte select 




SIOT 


Yes 


Byte select 




Sl02 


Yes 


Byte select 




SI03 


Yes 


Byte select 




Cn 


No 


Inactive 
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EJLU Byte AMD R AND S (Byte logical AMD B AND S) BAWD 



Status Signals 



ZERO = 1 if result (selected bytes) = 0 

N = 0 

OVR =0 

C = 0 



EXAMPLE (assumes a 32-bit configuration) 

Logically AND bytes 1 and 2 of register 3 with input on the DB bus; store the result 
in register 3. 



Instr 


Oprd 


Oprd 


Oprd Set 


Dest 


Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 


VVE3- SELRF1- 0EY3- 




CF2- 


SI03- 


IESI03- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ WEO SELRFO OEA OEB OEYO OES 


Cn 


CFO 


sloo 


lESiOO 


1110 1000 


00 001 1 


XX XXXX 


0 10 


00 001 1 


0 0000 10 X X XXXX 0 


X 


110 


1001 


0000 



z 

J* Assume register file 3 holds 398FBEBE (Hex) and input on the DB port is 4290BFBF 
3 (Hex): 

00 
00 



CO 



Source 



0011 1001 1000 1111 1011 1110 1011 1110 



Rn - RF(3)n 



Source 01000010100100001011111110111111 Sn *- DBn 

RF(3)n - Fn or Sn f 



Destination 



0100001010000000 1011 11101011 1111 



f F = ALU result 
n = nth byte 

Register file 3 gets F if byte selected, S if byte not selected. 
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BCDBIN 



BCD to Binary 



cm 



FUNCTION 

Converts a BCD number to binary. 
DESCRIPTION 

This instruction allows the user to convert an N-digit BCD number to a 4N-bit binary 
number in 4(N-1) plus 8 clocks. The instruction sums the R and S buses with carry. 

A one-bit arithmetic left shift is perf ormed on the ALU output. A zero is filled into bit 0 
of the least significant byte unless SIO0 is set low, which would force bit 0 to one. 
Bit 7 of the most significant byte is dropped. 

Simultaneously, the contents of the MQ register are rotated one bit to the left. Bit 
7 of the most significant byte is rotated to bit 0 of the least significant byte. 

Recommended R Bus Source Operands 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


Yes 


No 


No 


No 


Recommended S Bus Source Operands 


RF 

(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


No 




Recommended Destination Operands 


RF 

(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


No 





CM 
CO 
00 
00 
h- 

o 
< 

Z 
(/) 



ALU 


MQ 


Left 


Left 
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BCD to Binary 



BCDBIN 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 




Yes 


If high or floating, fills a zero in LSB of ALU shifter; 
if low, fills a one in LSB of ALU shifter. 


sioT 


No 


Inactive in 32-bit configuration. Used in other 


sm 


No 


configurations to select endf ill in LSBs. 


Si03 


No 




Cn 


Yes 


Should be programmed low for proper conversion. 



Z 

> 

O 
H 
00 
00 
CO 

ro 



Status Signals 



ZERO = 1 if result = 0 

N = 1 if MSB = 1 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out = 1 



ALGORITHM 

The following code converts an N-digit BCD number to a 4N-bit binary number in 4(N-1 ) 
plus 8 clocks. It employs the standard conversion formula for a BCD number (shown 
here for 32 bits): 

ABCD = [(A x 10 + B) x 10 + C] x 10 + D. 



The conversion begins with the most significant BCD digit. Addition is performed in 
radix 2. 



LOADMQ NUM 

SUB ACC, ACC, SLCMQ 

SUB MSK, MSK, SLCMQ 

SLCMQ 
SLCMQ 

ADDI ACC, MSK, 1 5 



Load MQ with BCD number. 

Clear accumulator; 
Circular left shift MQ. 

Clear mask register; 
Circular left shift MQ. 

Circular left shift MQ. 

Circular left shift MQ. 

Store 15 in mask register. 



3-68 



BCDBIN 



BCD to Binary 



7 F 



Repeat N-1 times: 

(N = number of BCD digits) 
AND MQ, MSK, R1, SLCMQ 

ADD, ACQ, R1,.R1, SLCMQ 

BCDBIN R1, R1, ACC 



BCDBIN ACC, R1, ACC 



(END REPEAT) 

AND MQ MSK, R1 
ADD ACC, R1, ACC 



Extract one digit; 
Circular left shift MQ. 

Add extracted digit to accumulator, and 
store result in R1; Circular left shift MQ. 

Perform BCDBIN instruction, and store 

result in accumulator 

[4 x (ACC + 4 x digit)]; 

Circular left shift MQ. 

Perform BCDBIN instruction, and store 
result in accumulator 
[10 x (ACC + 10 x digit)]; 
Circular left shift MQ. 



Fetch last digit. 

Add in last digit and store result in 
accumulator. 



CM 
CO 
00 
00 

h- 
o 
< 

2 

V) 
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Byte Increment Negative S with Carry 



BINCNS 



FUNCTION 

S' + Cn for selected bytes of S. 
DESCRIPTION 



Bytes with SIO0 programmed low compute S' + Cn. Bytes with SIOO programmed 
high pass S unaltered. Multiple bytes can be selected only if they are adjacent to one 
another. At least one byte must be nonselected. 



Available R Bus Source Operands 



C/> 
Z 

> 
O 
H 
00 
00 
GO 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


No 


No 


No 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





Available Destination Operands Shift Operations 



RF 

(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 



Control/Data Signals 



Signal 



User 
Programmable 



Use 



SSF 

SIOO 

SIOT 

SI02 

SI03 

Cn 



No 

Yes 

Yes 

Yes 

Yes 

Yes 



Inactive 
Byte select 
Byte select 
Byte select 
Byte select 

Propagates through nonselected bytes; increments 
selected byte(s) if programmed high. 
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BINCNS Byte Increment Negative S with Carry 1 C 18 



Status Signals 



ZERO = 1 if result (selected bytes) =0 

N = 0 

OVR = 1 if signed arithmetic overflow (selected bytes) 

C = 1 if carry-out (most significant selected byte) = 1 



EXAMPLE (assumes a 32-bit configuration) 

Invert bytes 0 and 1 of register 3 and add them to the carry (bytes 2 and 3 are not 
changed). Store the result in register 3. 



Instr 
Code 
I7-I0 


Oprd 
Addr 
A5-A0 


Oprd 
Addr 
B5-B0 


Oprd Set 
EB1- 
EA EBO 


Dest 
Addr 
C5-C0 


Destination Selects 
WE3- SELRF1- OEY3- 
SELMQ WEO SELRFO OEA OEB OEYO OES 


Cn 


CF2- 
CFO 






SI03- 
SIOO 


IESI03- 


IESIO0 


1100 1000 


XX XXXX 


00 0001 


X 00 


00 001 1 


0 0000 10 X X XXXX 0 


1 


110 


1100 


0000 



Assume register file 3 holds A3018181 (Hex): 
Source 



1010 001 1 0000 0001 1000 0001 1000 0001 



Sn - RF(3)n 



Destination 



ALU 0101 1100 1111 1110 0111 1110 0111 1111 Fn *- S'n + Cn 

RF(3)n - Fn or Sn* 



1010 0011 0000 0001 0111 1110 0111 1111 



T F = ALU result 
n = nth byte 

Register file 3 gets F if byte selected, S if byte not selected. 
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B 



8 



Byte Increment S with Carry 



BINCS 



FUNCTION 

Increments selected bytes of S if the carry is set. 
DESCRIPTION 

Bytes with SIO' inputs programmed low compute S + Cn. Bytes with SIO inputs 
programmed high, pass S unaltered. Multiple bytes can be selected only if they are 
adjacent to one another. At least one byte must be nonsselected. 



Available R Bus Source Operands 



2 

■si 

> 

O 
H 
00 
00 
OJ 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


No 


No 


No 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





Available Destination Operands Shift Operations 



RF 
(C5-C0) 


(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


SIO0 


Yes 


Byte select 


SI01 


Yes 


Byte select 


SI02 


Yes 


Byte select 


Sl03 


Yes 


Byte select 


Cn 


Yes 


Propagates through nonselected bytes; increments 
selected byte(s) if programmed high. 
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BINCS 



Byte Increment S with Carry 



B 8 



Status Signals 



ZERO = 1 if result (selected bytes) =0 

N = 0 

OVR = 1 if signed arithmetic overflow (selected bytes) 

C = 1 if carry-out (most significant selected byte) = 1 



EXAMPLE (assumes a 32-bit configuration) 

Add bytes 1 and 2 of register 7 to the carry (bytes 0 and 3 are not changed). Store 
the result in register 2. 



Instr 
Code 
I7-I0 


Oprd 
Addr 
A5-A0 


Oprd 
Addr 
B5-B0 


Oprd Sel 
EB1- 
EA EBO 


Dest 
Addr 
C5-C0 


Destination Selects 
WE3- SELRF1- OEY3- 
SELMQ WEO SELRFO OEA OEB OEYO OES 


Cn 


CF2- 
CFO 


SI03- 


IESI03- 


SIOO 


IESIO0 


1011 1000 


XX XXXX 


00 01 1 1 


X 00 


00 0010 


0 0000 10 X X XXXX 0 


1 


110 


1100 


0000 




Assume register file 7 holds 408FBEBE (Hex): 
Source 



0100 0000 1000 1111 1011 1110 1011 1110 



Sn «- RF(7)n 



Destination 



ALU 01000000100011111011111110111110 Fn *- Sn + Cn 

RF(2)n - Fn or Sn f 



00 
I- 

o 
< 

CO 



0100 0000 1000 1111 1011 1111 1011 1110 



f F = ALU result 
n = nth byte 

Register file 1 1 gets F if byte selected, S if byte not selected. 
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D F 



Binary to Excess-3 



BINEX3 




CO 

z 

> 

o 

H 

00 
00 
CO 

ho 



FUNCTION 

Converts a binary number to excess-3 representation. 
DESCRIPTION 

This instruction converts an N-digit binary number to a N/4 digit excess-3 number 
representation in 2N + 3 clocks. The data on the R and S buses are added to the carry- 
in, which contains the most significant bit of the MQ register. The contents of the 
MQ register are rotated one bit to the left. The most significant bit is shifted out and 
passed to the least significant bit position. Depending on the configuration selected, 
this shift may be within the same byte or from the most significant byte to the least 
significant byte. 

Recommended R Bus Source Operands 









C3-CO 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


Yes 


No 


No 


No 


Recommended S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


No 




Recommended Destination Operands 


RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


Yes 





ALU 


MQ 


None 


Left 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


sToo 


No 


Inactive 


SlOT 


No 


Inactive 


SI02 


No 


Inactive 


SI03 


No 


Inactive 


Cn 


No 


Holds MSB of MQ register. 
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BINEX3 Binary to Excess -3 1 D 1 F 



Status Signals 



ZERO = 1 if result = 0 

N =1 if MSB = 1 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out = 1 



ALGORITHM 

The following code converts an N-digit binary number to a N/4 digit excess-3 number 
in 2N + 3 clocks. It employs the standard conversion formula for a binary number: 

a n 2" + a n -i 2"-1 + an-2 2 n " 2 + . . . + a 0 - = 

CM 

([(2a n + a n -l) x 2 + a n -il x 2 <+... + a 0 } x 2 + a°. CO 

00 
00 

The conversion begins with the most significant bit. Addition during the BINEX3 H" 
instruction is performed in radix 10 (excess-3). ^ 

<fr 

LOADMQ NUM Load MQ with binary number. 

Z 

SUB ACG, ACC, ACC Clear accumulator; C/> 

SET1 ACC, 33 (Hex) Store 33 (Hex) in all bytes of 

accumulator. 

Repeat N times: 

(N = number of bits in binary number) 

BINEX3 ACC, ACC, ACC Double accumulator and add in most 

significant bit of MQ register. Circular left 
shift MQ. 

EX3C ACC Perform excess-3 correction. 

(END REPEAT) 



3-75 



F 8 



Byte OR R and S 
(Byte Inclusive OR R and S) 



BOR 



FUNCTION 

Evaluates R OR S of selected bytes. 
DESCRIPTION 

Bytes with SIO inputs programmed low evaluate R OR S. Bytes with SIO inputs 
programmed high, pass S unaltered. Multiple bytes can be selected only if they are 
adjacent to one another. At least one byte must be nonselected. 



Available R Bus Source Operands 



Z 
vJ 

> 

O 
H 
00 
00 
CO 
N) 









C3-C0 


RF 
(A5-AO) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


Yes 


No 


Yes 


No 


Available S Bus Source Operands 


RF 
(B5-BO) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 




SIO0 


Yes 


Byte select 




SioT 


Yes 


Byte select 




5102 


Yes 


Byte select 




5RJ5 


Yes 


Byte select 




Cn 


No 


Inactive 
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BOR 



Byte OR R and S 
(Byte Inclusive OR R and S) 



F 8 



Status Signals 



ZERO = 1 if result (selected bytes) = 0 

N = 0 

OVR =0 

C = 0 



EXAMPLE (assumes a 32-bit configuration) 

Logically OR bytes 1 and 2 of register 1 2 with bytes 1 and 2 on the DB bus. Concatenate 
with DB bytes 0 and 3/ storing the result in register 12. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 




Destination Selects 














Code 


Addr 


Addr 


EB1- 


Addr 




WE3- SELRF1- 


0EY3- 






CF2- 


sioT- 


IESI03- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


sioo 


IESIO0 


1111 1000 


00 1100 


XX XXXX 


0 10 


00 1100 


0 


0000 10 X X 


XXXX 


0 


X 


110 


1001 


0000 



CM 

n 

CO 

oo 



Assume register file 1 2 holds 578FBEBE (Hex) and the DB bus holds 1 C90BEBE (Hex): £j 

Rn - RF(12)n 



Source 



0101 0111 1000 1111 1011 1110 1011 1110 



Source 0001 1100 1001 oooo 1011 1110 1011 1100 



Sn *- DBn 



< 

Z 
CO 



Destination 



0001 1100 1001 1111 1011 1110 1011 1110 



RF(12)n +- Fn or Sn 1 



f F = ALU result 
n = nth package 

Register file 12 gets F if byte selected, S if byte not selected. 
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Byte Subtract R from S with Carry 



BSUBR 



FUNCTION 

Subtracts R from S in selected bytes. 
DESCRIPTION 

Bytes with SIO inputs programmed low compute R' + S + Cn. Bytes with SIO inputs 
programmed high, pass S unaltered. Multiple bytes can be selected only if they are 
adjacent to one another. At least one byte must be nonselected. 



Available R Bus Source Operands 



> 

O 
H 
00 
00 
Ca) 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


Yes 


No 


Yes 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-BO) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


sioo 


Yes 


Byte select 


5ioT 


Yes 


Byte select 




Yes 


Byte select 


SI03 


Yes 


Byte select 


Cn 


Yes 


Propagates through nonselected bytes; should be 
set high for two's complement subtraction. 
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BSUBR 



Byte Subtract R from S with Carry 



A 8 



Status Signals 



ZERO = 1 if result (selected bytes) = 0 

N = 0 

OVR = 1 if signed arithmetic overflow (selected bytes) 

C = 1 if carry-out (most significant selected byte) = 1 



EXAMPLE (assumes a 32-bit configuration) 

Subtract bytes 1 and 2 of register 1 with carry from bytes 1 and 2 of register 3. 
Concatenate with bytes 0 and 3 of register 3, storing the result in register 11. 



Instr 
Code 
I7-I0 


Oprd 
Addr 
A5-A0 


Oprd 
Addr 
B5-B0 


Oprd Sei 
EB1- 
EA EBO 


Dest 
Addr 
C5-C0 


Destination Selects 
WE3- SELRF1- 0EY3- 
SELMQ WEO SELRFO OEA OEB OEYO OES 


Cn 


CF2- 
CFO 


SI03- 


IESI03- 


SIOO 


IESIO0 


1010 1000 


00 0001 


00 001 1 


0 00 


00 1011 


0 0000 10 X X XXXX 0 


1 


110 


1001 


0000 



CM 
CO 
00 
00 

O 
< 

z 

CO 



Assume register file 1 holds 091 B5858 (Hex) and register file 3 holds 703A9898 (Hex): 



Source 



0000 1001 0001 1011 0101 1000 0101 1000 



Rn - RF(1)n 



Source 0111 00000011 1010 1001 1000 1001 1000 Sn RF(3)n 



Destination 



ALU 01100111000111110100000001000000 Fn *- R'n +Sn + Cn 

RF(1 1)n «- Fn or Snt 



0111 00000001 1111 01000000 1001 1000 



f F = ALU result 
n = nth package 

Register file 1 1 gets F if byte selected, S if byte not selected. 
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9 



8 



Byte Subtract S from R with Carry 



BSUBS 



FUNCTION 

Subtracts S from R in selected bytes. 
DESCRIPTION 

Bytes with SIO inputs programmed low compute R + S' + Cn. Bytes with SIO inputs 
programmed high, pass S unaltered. Multiple bytes can be selected only if they are 
adjacent to one another. At least one byte must be nonselected. 



Available R Bus Source Operands 



z 

> 

O 
H 
00 
00 
CO 

ro 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


Yes 


No 


Yes 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





Available Destination Operands Shift Operations 



RF 

(G5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


SlOO 


Yes 


Byte select 


§IST 


Yes 


Byte select 


3102 


Yes 


Byte select 


S1S3 


Yes 


Byte select 


Cn 


Yes 


Propagates through nonselected bytes; should be 
set high for two's complement subtraction. 
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BSUBS 



Byte Subtract S from R with Carry 



9 8 



Status Signals 



ZERO = 1 if result (selected bytes) = 0 

N = 0 

OVR = 1 if signed arithmetic overflow (selected bytes) 

C = 1 if carry-out (most significant selected byte) = 1 



EXAMPLE (assumes a 32-bit configuration) 

Subtract bytes 1 and 2 of register 3 with carry from bytes 1 and 2 of register 1 . 
Concatenate with bytes 0 and 3 of register 3, storing the result in register 1 1 . 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 




Destination Selects 














Code 


Addr 


Addr 


EB1- 


Addr 




WE3- SELRF1- 


OEY3- 






CF2- 


sio3- 


IESI03- 


I7-IO 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


sibo 


IESIO0 


1001 1000 


00 0001 


00 001 1 


0 00 


00 1011 


0 


0000 10 X X 


XXXX 


0 


1 


110 


1001 


0000 



CO 

Assume register file 1 holds 5288B8B8 (Hex) and register file 3 holds 143A9898 (Hex): 00 

00 



Source 
Source 



0101 0010 1000 1000 1011 1000 1011 1000 



0001 0100 0011 1010 1001 1000 1001 1000 



Rn «- RF(1)n 
Sn - RF(3)n 



< 

Z 

C/) 



Destination 



ALU 0011 1110 0100 1110 0010 0000 0010 0000 Fn Rn +S'n + Cn 

RF(1 1)n - Fn or Sn* 



0101 0010 0100 1110 0010 0000 1011 1000 



f F = ALU result 
n = nth byte 

Register file 1 1 gets F if byte selected, S if byte not selected. 
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D 



8 



Byte XOR R and S 
(Byte Exclusive OR R and S) 



BXOR 



FUNCTION 

Evaluates R exclusive OR S in selected bytes. 



DESCRIPTION 

Bytes with SIO inputs programmed low evaluate R exclusive OR S. Bytes with SIO 
inputs programmed high, pass S unaltered. Multiple bytes can be selected only if they 
are adjacent to one another. At least one byte must be nonselected. 



Available R Bus Source Operands 



Z 

> 

O 
H 
00 
00 
CO 









C3-C0 


RF 
(A5-AO) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


Yes 


No 


Yes 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 



Control/Data Signals 



Signal 


User 
Programmable 




SSF 


No 


Inactive 


Sioo 


Yes 


Byte select 


SioT 


Yes 


Byte select 


Sf02 


Yes 


Byte select 


§103 


Yes 


Byte select 


Gn 


No 


Inactive 



Use 
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Byte XOR R and S 
BXOR (Byte Exclusive OR R and S) 1 D I 8 



Status Signals 



ZERO = 1 if result (selected bytes) = 0 

N = 0 

OVR =0 

C =0 



EXAMPLE (assumes a 32-bit configuration) 

Exclusive OR bytes 1 and 2 of register 6 with bytes 1 and 2 on the DB bus; concatenate 
the result with DB bytes 0 and 3, storing the result in register 10. 



Instr 


Oprd 


Oprd 


Oprd Set 


Dest 


Destination Selects 














Code 


Addr 


Addr 


EB1- 


Addr 


WE3- SELRF1- 


0EY3- 






CF2- 


SI03- 


IESI03- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ WEO SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


sioo 


IESIO0 


1101 1000 


00 0110 


XX XXXX 


0 10 


00 1010 


0 0000 10 X X 


XXXX 


0 


1 


110 


1001 


0000 



CM 
CO 
00 
00 

O 
< 

z 

CO 



Assume register file 6 holds 938FBEBE (Hex) and the DB bus holds 41 90BEBE (Hex): 



Source 1001 0011 1000 11111011111010111110 Rn *- RF(6)n 



Source 01000001 1001 0000 1011 1110 1011 1110 Sn DBn 



Destination 



0100 0001 0001 1111 0000 0000 1011 11 10 



RF(10)n ««- Fn or Sn f 



T F = ALU result 
n = nth package 

Register file 10 gets F if byte selected, S if byte not selected. 
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1 F 



CLEAR 



CLR 



FUNCTION 

Forces ALU output to zero and clears the BCD flip-flops. 
DESCRIPTION 

ALU output is forced to zero and the BCD flip-flops are cleared. 

*This instruction may also be coded with the following opcodes: 
[2] [F], [3] [F], [4] [F], [6] [F], IB] [F], [C] [F], [E] [F] 



Available R Bus Source Operands 




Z 

> 

o 

H 

00 
00 
CO 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


No 


No 


No 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




No 


No 


No 





Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 



Status Signals 



ZERO = 1 

N = 0 

OVR = 0 

Cn = 0 
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CRC 



Cyclic Redundancy Character Accumulation 



0 0 



FUNCTION 

Evaluates R exclusive OR S for use with cyclic redundancy check codes. 
DESCRIPTION 

Data on the R bus is exclusive ORed with data on the S bus. If MQO XNORed with 
SO is zero (MQO is the LSB of the MQ register and SO is the LSB of S-bus data), the 
result is sent to the ALU shifter. Otherwise, data on the S bus is sent to the ALU shifter. 

A right shift is performed; the MSB is filled with RO (MQO XOR SO), where RO is the 
LSB of R-bus data. A circular right shift is performed on MQ data. 



Recommended R Bus Source Operands 









C3-C0 


(A5-AO) 


A3- AO 
Immed 


DA-Port 


A3- AO 
Mask 


Yes 


No 


No 


No 


Recommended S Bus Source Operands 


RF 
(B5-BO) 


DB-Port 


MQ 
Register 




Yes 


Yes 


No 




Recommended Destination Operands 


RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


No 





CO 
00 
CO 
h- 

o 
< 

z 



ALU 


MQ 


Right 


Right 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


SIOO 


No 


Inactive 


SI01 


No 


Inactive 


SICil 


No 


Inactive 


SI03 


No 


Inactive 


Cn 


No 


Inactive 
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I 0 I 0 1 Cyclic Redundancy Character Accumulation CRC 



Status Signals 



ZERO 


= 1 


N 


= 0 


OVR 


= 0 


Cn 


= 0 



if result = 0 



CYCLIC REDUNDANCY CHARACTER CHECK 
DESCRIPTION 

3 Serial binary data transmitted over a channel is susceptible to error bursts. These bursts 
may be detected and corrected by standard encoding methods such as cyclic 
redundancy check codes, fire codes, or computer generated codes. These codes all 
J£ divide the message vector by a generator polynomial to produce a remainder that 
contains parity information about the message vector. 

^ If a message vector of m bits, a(x), is divided by a generator polynomial, g(x), of order 

O k-1 , a k bit remainder, r(x), is formed. The code vector, c(x), consisting of m(x) and 

^ r(x) of length n = m + k is transmitted down the channel. The receiver divides the 

00 received vector by g(x). 
GO 

After m divide iterations, r(x) will be regenerated only if there is no error in the message 
bits. After k more iterations, the result will be zero if and only if no error has occurred 
in either the message or the remainder. 



ALGORITHM 



An algorithm for a cyclic redundancy character check, using the 'ACT8832 as a 
receiver, is given below: 

Load MQ with first 32 message bits of 
received vector c'(x). 



LOADMQ VEG(X) 



LOAD POLY 
CLEAR SUM 
REPEAT (n/32) TIMES: 

SUM = SUM CRC POLY 



LOADMQ VEC(X) 
(END REPEAT) 



Load register with polynomial g(x). 
Clear register acting as accumulator. 



Perform CRC instruction where 

R Bus = POLY 

S Bus = SUM 
Store result in SUM. 

Load MQ with next 32 message bits of 
received vector c'(x). 
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CRC Cyclic Redundancy Character Accumulation I 0 I 0 



SUM now contains the remainder [r'(x)] of c'(x). A syndrome generation routine may 
be called next, if required. 

Note that the most significant bit of 

g(x) = (o k .i)(xk-1).+ (g k . 2 )(x k -2) + . .(g 0 )(xQ) 

is implied and that POLY(O) is set to zero if the length of g(x) requires fewer bits than 
are in the machine word width. 



3 



CM 
CO 
00 
00 

o 
< 

2 
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4 



0 



Divide Remainder Fix 



DIVRF 



z 

> 

o 

H 
00 
00 
GO 

ro 



FUNCTION 

Corrects the remainder of nonrestoring division routine if correction is required. 
DESCRIPTION 

DIVRF tests the result of the final step in nonrestoring division iteration: SDIVIT (for 
signed division) or UDIVIT (for unsigned division). An error in the remainder results 
when it is nonzero and the signs of the remainder and the dividend are different. 

The R bus must be loaded with the divisor and the S bus with the most significant 
half of the previous result. The least significant half is in the MQ register. The Y bus 
result must be stored in the register file for use during the subsequent SDIVQF 
instruction. 

DIVRF tests to determine whether a fix is required and evaluates: 

Y S + R' + 1 if a fix is necessary 

Y S + R + 0 if a fix is unnecessary 

Overflow is reported to OVR at the end of the division routine (after SDIVQF). 
Recommended R Bus Source Operands 









C3-C0 


RF 


A3-A0 


DA-Port 








(A5-A0) 


Immed 


A3-A0 








Mask 


Yes 


No 


No 


No 



Recommended S Bus Source Operands 



RF 
(B5-B0) 


DB-Port 


MQ 
Register 


Yes 


Yes 


No 



Recommended Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


No 



ALU 


MQ 


None 


None 
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DIVRF Divide Remainder Fix 14 10 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


sISo 


No 


Inactive 


sioT 


No 


Inactive 


siol 


No 


Inactive 


SI03 


No 


Inactive 


Cn 


Yes 


Should be programmed high 


Status Signals 


ZERO = 


1 if remainder = 


0 


N = 


0 




OVR = 


0 




Cn = 


1 if carry-out = 1 
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Double-length Normalize 



DNORM 



FUNCTION 

Tests the two most significant bits of a double precision number. If they are the same, 
shifts the number to the left. 



(A 
2 

> 

O 
H 
00 
00 
Ca> 

ho 



DESCRIPTION 

This instruction is used to normalize a two's complement, double precision nu mber 
by shifting the number one bit to the left and filling a zero into the LSB unless SIO0 
is low. The S bus holds the most significant half; the MQ register holds the least 
significant half. 

Normalization is complete when overflow occurs. The shift is inhibited whenever 
normalization is attempted on a number already normalized. 

Available R Bus Source Operands 









C3-C0 


RF 
(A5-AO) 


A3-AO 
Immed 


DA-Port 


A3-A0 
Mask 


No 


No 


No 


No 


Recommended S Bus Source 
Operands (MSH) 




RF 
(B5-BO) 


DB-Port 


MQ 
Register 




Yes 


No 


No 




Recommended Destination 
Operands 




RF 
(C5-CO) 


RF 
(B5-BO) 


Y-Port 




Yes 


No 


No 





Shift Operations 
(conditional) 



ALU 


MQ 


Left 


Left 
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DNORM 



Double-Length Normalize 



3 0 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


§100 


Yes 


When low, selects a one end-fill bit in LSB 


slor 


No 


Passes internally generated end-fill bits 


SI02 


No 




5103 


No 




Cn 


No 





Status Signals 



ZERO = 1 if result = 0 

N = 1 if MSB = 1 

OVR = 1 if MSB XOR 2nd MSB 

Cn =0 



EXAMPLE (assumes a 32-bit configuration) 
Normalize a double-precision number. 

(This example assumes that the MSH of the number to be normalized is in register 3 
and the LSH is in the MQ register. The zero on the OVR pin at the end of the instruction 
cycle indicates that normalization is not complete and the instruction should be 
repeated). 



M 
CO 
00 
00 
h- 
U 
< 

2 
CO 



Instr 


Oprd 


Oprd 


Oprd Set 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EB0 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


0011 0000 


XX XXXX 


00 001 1 


X 00 


00 001 1 


0 


0000 


10 X X 


XXXX 


0 


X 


110 



Assume register file 3 holds FA75D84E (Hex) and MQ register holds 37F6D843 (Hex): 



0011 0111 1111 0110 1101 1000 0100 0011 



Source 1111 10100111 0101 1101 10000100 1110 ALU shifter *- RF(3) 
Source 
Destination 
Destination 



1111 0100 1110 1011 1011 0000 1001 1101 



0110 1111 11101101 1011000010000110 



MQ shifter <«- MQ register 
8RR3) *- Result (MSH) 
MQ register *- Result (LSH) 



OVR - O 1 " 



^Normalization not complete at the end of this instruction cycle. 
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Output Divide/BCD Flip Flops 



DUMPFF 



FUNCTION 

Output contents of the divide/BCD flip-flops. 
DESCRIPTION 

The contents of the divide/BCD flip-flops are passed through the MQ register to the 
Y output Imultiplexer. 



Available R Bus Source Operands 



2 

> 

O 
H 
00 
00 
CO 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


No 


No 


No 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




No 


No 


No 





Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


No 


No 


Yes 



ALU 


MQ 


None 


None 



Status Signals 
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OUMPFF 



Output Divide/BCD Flip Flops 



5 F 



EXAMPLES (assumes a 32-bit configuration) 
Dump divide/BCD flip-flops to Y output. 



Instr 
Code 
17-10 


Oprd 
Addr 
A5-A0 


Oprd 
Addr 
B5-B0 


Oprd Sel 
EB1- 
EA EBO 


Dest 
Addr 
C5-C0 


Destination Selects 
WE3- SELRF1- OEY3- 
SELMQ WEO SELRFO OEA OEB OEYO OES 


Cn 


CF2- 
CFO 


0101 1111 


XX XXXX 


XX XXXX 


X XX 


XX XXXX 


1 XXXX XX X X 0000 X 


X 


110 



Assume divide/BCD flip-flops contain 2A055470 (Hex): 



Source ooio ioio oooo oioi oioi 0100 oi 1 1 oooo MQ register *- Divide/BCD flip-flops 



Destination 0010 1010 oooo 0101 0101 0100 01 1 1 oooo Y output *- MQ register 



CM 
CO 
00 
00 

o 
< 

z 
c/) 
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JJJU Excess-3 Byte Correction EX3BC 



z 

o 

H 
00 
00 
CO 



FUNCTION 

Corrects the result of excess-3 addition or subtraction in selected bytes. 
DESCRIPTION 

This instruction corrects excess-3 additions or subtractions in the byte mode. For 
correct excess-3 arithmetic, this instruction must follow each add or subtract. The 
operand must be on the S bus. 

Data on the S bus is added to a constant on the R bus determined by the state of 
the BCD flip flops and previous overflow condition reported on the SSF pin. Bytes with 
SIO inputs programmed low evaluate the correct excess-3 representation. Bytes with 
SIO inputs programmed high or floating, pass S unaltered. 

Available R Bus Source Operands 









C3-C0 


RF 


A3-A0 








DA-Port 




(A5-A0) 


Immed 


A3-A0 








Mask 


No 


No 


No 


No 



Available S Bus Source Operands 



RF 
(B5-B0) 


DB-Port 


MQ 
Register 


Yes 


No 


No 



Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


No 



ALU 


MQ 


No 


No 



Control/Data Signals 



Signal 


User 
Programmable 




SSF 


No 


Inactive 


sioo 


Yes 


Byte select 


SI01 


Yes 


Byte select 


SI02 


Yes 


Byte select 


SI03 


Yes 


Byte select 


Cn 


No 


Inactive 



Use 
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EX3BC 



Excess-3 Byte Correction 



8 F 



Status Signals 



ZERO = 0 

N = 0 

OVR = 1 if arithmetic signed overflow 

Cn = 1 if carry-out = 1 



EXAMPLE (assumes a 32-bit configuration) 

Add two BCD numbers and store the sum in register 3. Assume data comes in on 
DB bus. 



1 . Clear accumulator (SUB ACC, ACQ 

2. Store 33 (Hex) in all bytes of register (SET1 R2, H/33/) 

3. Add 33 (Hex) to selected bytes of first BCD number (BADD DB, R2, R1) 

4. Add 33 (Hex) to selected bytes of second BCD number (BADD DB, R2, R3) 

5. Add selected bytes of registers 1 and 3 (BADD, R1, R3, R3) 

6. Correct the result (EX3BC, R3, R3) 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 




Destination Selects 














Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 






OEY3- 






CF2- 


SI03- 


IESI03- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO 


OEA 


OEB 


OEYO 


OES 


Cn 


CFO 


SIOO 


IESIO0 


1111 0010 


00 0010 


XX XXXX 


0 XX 


00 0010 


0 


0000 


10 


X 


X 


XXXX 


0 


1 


110 


XXXX 


XXXX 


0000 1000 


00 0010 


XX XXXX 


0 XX 


00 0010 


0 


0000 


10 


X 


X 


XXXX 


0 


X 


110 


XXXX 


XXXX 


1000 1000 


00 0010 


XX XXXX 


0 10 


00 0001 


0 


0000 


10 


X 


X 


XXXX 


0 


0 


110 


1100 


0000 


1000 1000 


00 0010 


XX XXXX 


0 10 


00 001 1 


0 


0000 


10 


X 


X 


XXXX 


0 


0 


110 


1100 


0000 


1000 1000 


00 0001 


00 001 1 


0 00 


00 001 1 


0 


0000 


10 


X 


X 


XXXX 


0 


0 


110 


1100 


0000 


1000 1111 


XX xxxx 


00 001 1 


X 00 


00 001 1 


0 


0000 


10 


X 


X 


XXXX 


0 


0 


110 


1100 


0000 



CM 
CO 
00 
00 

o 
< 

z 

CO 



Assume DB bus holds 51336912 at third instruction and 34867162 at fourth 
instruction. 



1 oooo oooo oooo oooo oooo oooo oooo oooo""| RF(2) 0 



2 oooo oooo oooo oooo ooi 1 oo 11 ooi i 001 1 | RF{2) *- 00003333 (Hex) 



3 0101 0001 0011 0011 1001 hop Qi 00 01 oi""| RF(1) RF(2) +DB 



4 0011 0100 10000110 10100100 1001 0101 RF{3) RF(2) + DB 



5 | 0011 0100 1000 0110 0100 0000 1101 ioicTj RF(3)n RF(1 )n + RF(3)n 



6 [ oon 0100 1000 0110 0100 0000 0111 oioo RF(3)n *~ Corrected RF(3)n result 
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Excess-3 Word Correction 



EX3C 



FUNCTION 

Corrects the result of excess-3 addition or subtraction. 
DESCRIPTION 

This instruction corrects excess-3 additions or subtractions in the word mode. For 
correct excess-3 arithmetic, this instruction must follow each add or subtract. The 
operand must be on the S bus. 

Data on the S bus is added to a constant on the R bus determined by the state of 
the BCD flip-flops and previous overflow condition reported on the SSF pin. 



Available R Bus Source Operands 



Z 

> 

o 

H 
00 
00 
CO 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


No 


No 


No 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


No 


No 





Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


No 


No 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 




sioo 


No 


Inactive 




SIOT 


No 


Inactive 






No 


Inactive 




SI03 


No 


Inactive 




Cn 


No 


Inactive 
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Status Signals 



ZERO 
N 

OVR 
Cn 



= 0 



1 if MSB = 1 

1 if arithmetic signed overflow 
1 if carry-out = 1 



EXAMPLE (assumes a 32-bit configuration) 

Add two BCD numbers and store the sum in register 3. Assume data comes in on 
DA bus. 

1 . Clear accumulator (SUB ACC, ACC) 

2. Store 33 (Hex) in all bytes of register (SET1 R2, H/33/) 

3. Add 33 (Hex) to all bytes of first BCD number (ADD DB, R2, R1) 

4. Add 33 <Hex) to all bytes of second BCD number (ADD DB, R2, R3) 

5. Add the excess-3 data (ADD, R1, R3, R3) 

6. Correct the excess-3 result (EX3C, R3, R3) 

7. Subtract the excess-3 bias to go to BCD result. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 






0EY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO 


OEA 


OEB 


OEYO 


OES 


Cn 


CFO 


1111 0010 


00 0010 


xx xxxx 


0 XX 


00 0010 


0 


0000 


10 


X 


X 


XXXX 


0 


1 


110 


0000 1000 


00 0010 


XX XXXX 


0 XX 


00 0010 


0 


0000 


10 


X 


X 


XXXX 


0 


X 


110 


1111 0001 


00 0010 


XX XXXX 


0 10 


00 0001 


0 


0000 


10 


X 


X 


XXXX 


0 


0 


110 


1111 0001 


00 0010 


XX XXXX 


0 10 


00 001 1 


0 


0000 


10 


X 


X 


XXXX 


0 


0 


110 


1111 0001 


00 0001 


00 001 1 


0 00 


00 001 1 


0 


0000 


10 


X 


X 


XXXX 


0 


0 


110 


1001 1111 


XX xxxx 


00 0011 


X 00 


00 001 1 


0 


0000 


10 


X 


X 


XXXX 


0 


0 


110 


1111 0010 


00 0010 


00 001 1 


0 00 


00 0011 


0 


0000 


10 


X 


X 


xxxx 


0 


0 


110 



CNJ 
CO 
00 
00 
H 

o 
< 

2 

CO 
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Excess-3 Word Correction 



EX3C 



Assume DB bus holds 51336912 at third instruction and 34867162 at fourth 
instruction. 



Results of Instruction Cycles: 
1 



0000 0000 0000 0000 0000 0000 0000 0000 



RF(2) - 0 



2 00110011 0011 0011 0011 0011 00T100U RF(2) *- 33333333 (Hex) 



3 10000100011001101001110001000101 RF(1)«-RF(2) +DB 



z 

> 

o 

H 

00 
00 
Ca> 

ro 



4 01100111 1011 1001 1010 010010010101 RF{3) *- RF(2) + DB 



5 1110 1100 0010 0000 0100 0000 1101 1010 RF(3)*-RF(1) + RF(3) 



6 1011 1001 0101 0011 0111 0011 10100111 RF(3) Corrected RF(3| result 



7 10000110001000000100000001110100 RF{3) *- RF(3) — RF(2) 
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INCNR 



Increment Negative R using Carry (R' + Gn) 



FUNCTION 

Evaluates R' + Cn. 

DESCRIPTION 

Data on the R bus is inverted and added with carry. The result appears at the ALU 
and MQ shifters. 

*The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14) of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 15. 



Available R Bus Source Operands 



RF 
(A5-A0) 


A3-AO 
Immed 


DA-Port 


C3-C0 

A3-A0 
Mask 






Yes 


No 


Yes 


No 


Available S Bus Source Operands 




RF 
(B5-B0) 


DB-Port 


MQ 
Register 




No 


No 


No 


Available Destination Operands 


RF 
(C5-C0) 


RF 
(B5-BO) 


Y-Port 


ALU 
Shifter 


MQ 
Shifter 




Yes 


No 


Yes 


Yes 


Yes 


Control/Data Signals 








Signal 


User 
Programmable 


Use 


SSF 


No 




Affeet shift instructions programmed in bits I7-I4 of 


SlOO 


No 




instruction field. 




§IoT 


No 










Si02 


No 










SI03 


No 










Cn 


Yes 




Increments if programmed high. 



CN 
00 
00 
00 

h- 
o 
< 

z 

CO 
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Increment Negative R using Garry (R' + Cn) 



INCNR 



Status Signals* 



ZERO = 1 if result = 0 

N = 1 if MSB = 1 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out = 1 



tc is ALU carry out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 

EXAMPLE (assumes a 32-bit configuration) 

Convert the data on the DA bus to two's complement and store the result in register 4. 



2 

> 

O 

QO Assume register file 1 holds 3791 FEF6 (Hex): 

00 
CO 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


GFO 


1111 0111 


XX XXXX 


XX XXXX 


1 XX 


00 0100 


0 


0000 


10 X X 


XXXX 


0 


1 


110 



Source 001 10111 1001 0001 1 1 1 1 1 1 10 1 1 1 1 01 10 



DA 



Destination 1100 1000 0110 1110 0000 0001 0000 1010 RF(4) *- R' + Cn 
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Increment Negative S using Carry (S' + Cn) 



FUNCTION 

Evaluates S' + Cn. 

DESCRIPTION 

Data on the S bus is inverted and added to the carry. The result appears at the ALU 
and MQ shifters. 

*The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14) of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 15. 



Available R Bus Source Operands 









C3-C0 




RF 
(A5-AO) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 




No 


No 


No 


No 




Available S Bus Source Operands 




RF 
(B5-B0) 


DB-Port 


MQ 
Register 






Yes 


Yes 


Yes 






Available Destination Operands 




RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


ALU 
Shifter 


MQ 
Shifter 


Yes 


No 


Yes 


Yes 


Yes 



CNI 
CO 
00 
00 
h- 
O 
< 

z 
c/) 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Affect shift instructions programmed in bits I7-I4 of 


SIOO 


No 


instruction field. 


5ioT 


No 




Si02 


No 




SI03 


No 




Cn 


Yes 


Increments if programmed high. 
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1*15 I Increment Negative S using Carry (S' + Cn) INCNS 



Status Signals * 



ZERO = 1 if result = 0 

N = 1 if MSB = 1 

OVR = 1 if signed arithmetic overflow 

C =1 if carry-out =1 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 



EXAMPLE (assumes a 32-bit configuration) 

Convert the data on the MQ register to one's complement and store the result in 
register 4. 

O) 
2 

> 
O 
H 

00 Assume MQ register file 1 holds 3791 FEF6 (Hex): 
00 

CO , : : , 

N) Source 00110111100100011111111011110110 S 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EB0 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1111 0101 


XX XXXX 


XX XXXX 


X 11 


00 0100 


0 


0000 


10 X X 


XXXX 


0 


0 


110 



MQ register 



Destination 1 100 1000 01 10 1 1 10 0000 0001 0000 1001 RF(4) S' + Cn 
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INCR 



Increment R using Carry (R + Cn) 



FTn 



FUNCTION 

Increments R if the carry is set. 
DESCRIPTION 

Data on the R bus is added to the carry. The sum appears at the ALU and MQ shifters. 

*The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14) of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 1 5. 



Available R Bus Source Operands 









C3-C0 




RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 




Yes 


No 


Yes 


No 




Available S Bus Source 






Operands (MSH) 








RF 
(B5-B0) 


DB-Port 


MQ 
Register 






No 


No 


No 






Available Destination Operands 




RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


ALU 
Shifter 


MQ 
Shifter 


Yes 


No 


Yes 


Yes 


Yes 



CM 
CO 
00 
00 
h- 
O 
< 

z 

CO 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Affect shift instructions programmed in bits I7-I4 of 


SIOO 


No 


instruction field. 


SIOT 


No 




SI02 


No 




Si03 


No 




Cn 


Yes 


Increments R if programmed high. 
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IE 



Increment R using Carry (R + Cn) 



INCR 



Status Signals* 



ZERO = 1 if result = 0 

N =1 if MSB = 1 

OVR = 1 if signed arithmetic overflow 

Cn = 1 if carry-out = 1 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated after shift 
operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 

EXAMPLE (assumes a 32-bit configuration) 

Increment the data on the DA bus and store the result in register 4. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


0EY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1111 0110 


XX XXXX 


XX XXXX 


1 XX 


00 0100 


0 


0000 


10 X X 


XXXX 


0 


1 


110 



2 



> Assume register file 1 holds 3791 FEF6 (Hex). 
O 



00 

00 Source 
CO 



0001 0111 1001 0001 1111 1110 1111 0110 



R +- DA 



Destination 0001 0111 1001 0001 1111 111011110111 RF(4) R + Cn 
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INCS 



Increment S using Carry (S + Cn) 



izn 



FUNCTION 

Increments S if the carry is set. 
DESCRIPTION 

Data on the S bus is added to the carry. The sum appears at the ALU and MQ shifters. 



*The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14) of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 15. 



Available R Bus Source Operands 









C3-C0 




RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 




No 


No 


No 


No 




Available S Bus Source Operands 




RF 
(B5-B0) 


DB-Port 


MQ 
Register 






Yes 


Yes 


Yes 






Available Destination Operands 




RF 
(C5-G0) 


RF 
(B5-B0) 


Y-Port 


ALU 
Shifter 


MQ 
Shifter 


Yes 


No 


Yes 


Yes 


Yes 




CM 
CO 
00 
00 
h- 

o 
< 

z 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Affect shift instructions programmed in bits I7-I4 of 


SIOO 


No 


instruction field. 


sioT 


No 




Sl02 


No 




Sl03 


No 




Cn 


Yes 


Increments S if programmed high. 
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±2 



Increment S using Carry (S + Cn) 



INCS 



Status Signals* 



2 

> 

O 
H 
00 
00 
GO 



ZERO = 1 if result = 0 

N = 1 if MSB = 1 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out =1 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 

EXAMPLE (assumes a 32-bit configuration) 

Increment the data in the MQ register and store the result in register 4. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1111 0100 


XX XXXX 


XX XXXX 


X 11 


00 0100 


0 


0000 


10 X X 


XXXX 


0 


1 


110 



Assume MQ register holds 54FF00FF (Hex): 
Source 



0101 0100 1 1 1 1 1111 0000 0000 1111 1111 



S *- MQ register 



Destination 0101 0100 1111 1111 00000001 00000000 RF(4) *-S + Cn 
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LOADFF 



Load Divide/BCD Flip Flops 



cm 



FUNCTION 

Load divide/BCD flip-flops from external data input. 
DESCRIPTION 

Uses an internal bypass path to load data from the S MUX directly into the divide/BCD 
flip-flops. 



Available R Bus Source Operands 









C3-C0 




RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 




No 


No 


No 


No 




Available S Bus Source Operands 




RF 
(B5-B0) 


DB-Port 


MQ 
Register 






Yes 


Yes 


Yes 






Available Destination Operands 




RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


ALU 
Shifter 


MQ 
Shifter 


No 


No 


No 


No 


No 



CM 
00 
00 
00 

O 
< 

z 



Control/Data Signals 



Signal 



User 
Programmable 



Use 



SSF 

sioo 

Sf5T 
SI02 
SI03 
Cn 



No 
No 
No 
No 
No 
No 



Inactive 
Inactive 
Inactive 
Inactive, 
Inactive 
Inactive 
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rrm 



load Divide/BCD Flip Flops 



LOADFF 



Status Signals 




EXAMPLE (assumes a 32-bit configuration) 

Load the divide/BCD flip-flops with data from the DB input bus. 



Instr 
Code 
17-10 


Oprd 
Addr 
A5-A0 


Oprd 
Addr 
B5-B0 


Oprd Sel 
EB1- 
EA EBO 


Dest 
Addr 
C5-C0 


Destination Selects 
WE3- SELRF1- OEY3- 
SELMQ WEO SELRFO OEA OEB OEYO OES 


Cn 


CF2- 
CFO 


0000 1111 


XX XXXX 


XX XXXX 


X 10 


XX XXXX 


X XXXX XX X X XXXX X 


X 


110 



Assume DB input holds 2A08C6 18 (Hex): 

> 

O Source 



0010 1010 0000 1000 1 100 01 10 0001 1000 



00 

Destination 



0010 1010 0000 1000 1 100 01 10 0001 1000 



S *- DB bus 

Divide/BCD flip-flops *- S 
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LOADMQ Pass (Y F) and Load MQ with F 



FUNCTION 

Passes the result of the ALU instruction specified in the lower nibble of the instruction 
field to Y and the MQ register. 

DESCRIPTION 

The result of the arithmetic or logical operation specified in the lower nibble of the 
instruction field (13-10) is passed unshifted to Y and the MQ register. 

*A list of ALU operations that can be used with this instruction is given in Table 15. 



Shift Operations 



ALU Shifter 


MQ Shifter 


None 


None 



Available Destination Operands 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



CM 
CO 
00 
00 
h- 
O 
< 

Z 
CO 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Outputs MQO (LSB) 


SlOS' 


No 


Inactive 


sioT 


No 


Inactive 


Si02 


No 


Inactive 


SI03 


No 


Inactive 


Cn 


No 


Inactive 



Status Signals 1 



ZERO = 1 if result = 0 

N = 1 if MSB of result = 1 

= 0 if MSB of result = 0 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out = 1 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 
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Pass (Y «- F) and Load MQ with F 



LOADMQ 



EXAMPLE (assumes a 32-bit configuration) 

Load the MQ register with data from register 1, and pass the data to the Y port. 
(In this example, data is passed to the ALU by and INCR instruction without carry-in.) 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1111 0110 


00 0001 


XX XXXX 


0 XX 


XX XXXX 


0 


XXXX 


XX X X 


XXXX 


0 


0 


110 



0) 
vj 

> 

O 
H 
00 
00 
CO 



Assume register file 1 holds 2A08C618 (Hex) 
Source 



Destination 



0010 1010 0000 1000 1100 0110 0001 1000 | R«-RF(1) 

MQ register *- R "+ Cn 



0010 1010 0000 1000 1 100 01 10 0001 1000 
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MQSLC 



Pass (Y +- F) with Circular Left MQ Shift 



H 



FUNCTION 

Passes the result of the ALU instruction specified in the upper nibble of the instruction 
field to Y MUX. Performs a circular left shift on MQ. 

DESCRIPTION 

The result of the arithmetic or logical operation specified in the lower nibble of the 
instruction field (13-10) is passed unshifted to Y MUX. 

The contents of the MQ register are rotated one bit to the left. The MSB is rotated 
out and passed to the LSB of the same word, which may be 1 , 2, or 4 bytes long. 

The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the MQ register. If SSF is low, the MQ register will not be altered. 

*A list of ALU operations that can be used with this instruction is given in Table 1 5. 
Shift Operations 



ALU Shifter 


MQ Shifter 


None 


Circular Left 



Available Destination Operands (ALU Shifter) 



CM 
CO 
00 
00 

O 
< 

2 

C/) 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


Yes 




Control/Data Signals 




Signal 


User 
Programmable 


Use 


SSF 


Yes 




Passes shift result if high or floating; retains MQ 
without shift if low. 


SlOO 


No 




Inactive 


SloT 


No 




Inactive 


Si02 


No 




Inactive 


SI03 


No 




Inactive 


Cn 


No 




Affects arithmetic operation programmed in bits 
I3-I0 of instruction field. 
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LDJLJLJ Pass (Y ^ F) with Circular Left MQ Shift MQSLC 



Status Signals T 



ZERO = 1 if result = 0 

N = 1 if MSB of result = 1 

= 0 if MSB of result = 0 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out = 1 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 



EXAMPLE (assumes a 32-bit configuration) 

Add data in register 1 to data on the DB bus with carry-in and store the unshifted 
(/> result in register 1 . Circular shift the contents of the MQ register one bit to the left. 
2 

> 
O 



Instr 


Oprd 


Oprd 


Oprd Sei 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


1a ebo 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1 101 0001 


00 0001 


XX XXXX 


0 10 


00 0001 


0 


0000 


10 X X 


XXXX 


0 


1 


110 



00 
00 

to Assume register file 1 holds 2508C618 (Hex), DB bus holds 1 1007530 (Hex), and 
MQ register holds 4DA99A0E (Hex). 



Source 0010 010.1 0000 1000 1100 0110 0001 1000 R *- RF(1) 



Source 0001 0001 0000 0000 01 11 0101 001 1 0000 S *- DB bus 



Destination 0011 01100000 1001 0011 1011 0100 1001 RF(1) *- R + S + Cn 



Source 0100 1101 1010 1001 1001 10100000 1110 MQ shifter *- MQ register 



Destination 1001 1011 0101 0011 0011 01000001 1100 MQ register *- MQ shifter 
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MQSLL 



Pass (Y *- F) with Logical Left MQ Shift 



FUNCTION 

Passes the result of the ALU instruction specified in the upper nibble of the instruction 
field to Y MUX. Performs a left shift on MQ. 



DESCRIPTION 

The result of the arithmetic or logical operation specified in the lower nibble of the 
instruction field (13-10) is passed unshifted to Y MUX. 

The contents of the MQ register are shifted one bit to the left. A zero is filled into 
the least significant bit of each word unless the SIO input for that word is programmed 
low; this will force the least significant bit to one. The MSB is dropped from each word, 
which may be 1, 2, or 4 bytes long, depending on the configuration selected. 

The shift may be made conditional on SSF. If SSF is high or floating, the shift result csl 
will be sent to the MQ register. If SSF is low, the MQ register will not be altered. £g 

*A list of ALU operations that can be used with this instruction is given in Table 15. £2 

o 
< 

•t< 
z 



Shift Operations 



ALU Shifter 


MQ Shifter 


None 


Logical Left 



Available Destination Operands (ALU Shifter) 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


Yes 




Control/Data Signals 




Signal 


User 
Programmable 


Use 


SSF 


Yes 




Passes shift result if high or floating; retains MQ 
without shift if low. 


SlSO 


Yes 




Fills a zero in LSB of MQ shifter if high or floating; 
sets LSB to one if low. 


SIOT 


No 




Inactive in 32-bit configuration; used in 


SI02 


No 




configurations to select end-fill in LSBs. 


5IS3 


No 






Cn 


No 




Affects arithmetic operation programmed in bits 
I3-I0 of instruction field. 
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Pass (Y <- F) with Logical Left MQ Shift 



MQSLL 



Status Signals* 



ZERO = 1 if result = 0 

N = 1 if MSB of result = 1 

= 0 if MSB of result = 0 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out = 1 



"•"C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 



2 

> 

O 
H 
00 
00 
CO 



EXAMPLE (assumes a 32-bit configuration) 

Add data in register 7 to data on the DB bus with carry-in and store the unshifted 
result in register 7. Shift the contents of the MQ register one bit to the left, filling 
a zero into the least significant bit. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 


Destination Selects 














Code 


Addr 


Addr 


EB1- 


Addr 


WE3- SELRF1- 


OEY3- 






CF2- 


SI03- 


IESI03- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ WEO SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


SIOO 


IESIO0 


1100 0001 


00 01 1 1 


XX XXXX 


0 10 


00 0111 


0 0000 10 X X 


XXXX 


0 


1 


110 


1111 


0000 



Assume register file 7 holds 7308C618 (Hex), DB bus holds 54007530 (Hex), and 
MQ register holds 61 A99A0E (Hex). 



Source 01 110011 0000 1000 1 100 01 10 0001 1000 



RF(7) 



Source 0101 0100 0000 0000 01 1 1 0101 001 1 0000 S *- DB bus 



Destination 11000111 0000 1001 0011 1011 0100 1001 RF(7) *- R + S + Cn 



Source 0110 0001 1010 1001 1001 1010 0000 1100 MQ shifter *- MQ register 



Destination 1 100 001 1 0101 001 1 001 1 0100 0001 1000 MQ register *- MQ shifter 
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MQSRA Pass (Y - F) with Arithmetic Right MQ Shift ULLl 



FUNCTION 



Passes the result of the ALU instruction specified in the upper nibble of the instruction 
field to Y MUX. Performs an arithmetic right shift on MQ. 

DESCRIPTION 

The result of the arithmetic or logical operation specified in the lower nibble of the 
instruction field (13-10) is passed unshifted to Y MUX. 

The contents of the MQ register are rotated one bit to the right. The sign bit of the 
most significant byte is retained. Bit 0 of the least significant byte is dropped. 

The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the MQ register. If SSF is low, the MQ register will not be altered. 

*A list of ALU operations that can be used with this instruction is given in Table 1 5. 
Shift Operations 



ALU Shifter 


MQ Shifter 


None 


Arithmetic Right 



Available Destination Operands (ALU Shifter) 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


Yes 




Control/Data Signals 




Signal 


User 
Programmable 


Use 


SSF 


Yes 




Passes shift result if high or floating; retains MQ 
without shift if low. 


SiOO 


No 




Outputs LSB of MQ shifter (inverted). 


SlST 


No 




Inactive in 32-bit configurations; used in other 


5io5 


No 




configurations to output LSBs from MQ shifter 


5T05 


No 




(inverted). 


Cn 


No 




Affects arithmetic operation programmed in bits 
I3-I0 of instruction field. 



CM 
CO 
00 
00 
h- 

o 
< 

z 

CO 
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nr 



Pass (Y F) with Arithmetic Right MQ Shift 



MQSRA 



Status Signals * 



ZERO = 1 if result = 0 

N = 1 if MSB of result = 1 

= 0 if MSB of result = 0 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out = 1 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 



z 

> 
o 

H 

00 
00 
Ca> 



EXAMPLE (assumes a 32-bit configuration) 

Add data in register 1 to data in register 10 with carry-in and store the unshif ted result 
in register 1 . Shift the contents of the MQ register one bit to the right, retaining the 
sign bit. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


0EY3- 






CF2- 


I7-IO 


A5-AO 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Gn 


CFO 


1010 0001 


00 0001 


00 1010 


0 00 


00 0001 


0 


0000 


10 X X 


XXXX 


0 


1 


110 



Assume register file 1 holds 5608C61 8 (H 
and MQ register holds 98A99A0E (Hex). 



Source 0101 01100000 1000 110001100001 1000 



RF(1) 



Source 0001 0100 0000 0000 01 1 1 0101 001 1 0000 S «- RF(1 0) 



Destination 0110 10100000 1001 0011 1011 0100 1001 RF(1) *- R + S + Cn 



Source 1001 1000 1010 1001 1001 10100000 1110 MQ shifter «- MQ register 



Destination 1100 1100 0101 0100 1100 1101 0000 0111 MQ register *- MQ shifter 
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MQSRL 



Pass (Y <- F) with Logical Right MQ Shift 



FUNCTION 

Passes the result of the ALU instruction specified in the upper nibble of the instruction 
field to Y MUX. Performs a right shift on MQ. 



DESCRIPTION 



The result of the arithmetic or logical operation specified in the lower nibble of the 
instruction field (13-10) is passed unshifted to Y MUX. 

The contents of the MQ register are shifted one bit to t he right. A zero is placed in 
the sign bit of the most significant byte unless the SIO input for that byte is set to 
zero; this will force the sign bit to 1. Bit 0 of the least significant byte is dropped. 

The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the MQ register. If SSF is low, the MQ register will not be altered. 

*A list of ALU operations that can be used with this instruction is given in Table 1 5. 
Shift Operations 




ALU Shifter 


MQ Shifter 


None 


Logical Right 



CM 
CO 
00 
00 
H 

o 
< 

2 
(/) 



Available Destination Operands (ALU Shifter) 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


Yes 




Control/Data Signals 




Signal 


User 
Programmable 


Use 


SSF 


Yes 




Passes shift result if high or floating; retains MQ 
without shift if low. 


5150 


Yes 




Fills a zero in LSB of MQ shifter if high or floating; 
sets LSB to one if low. 


siST 


No 




Inactive in 32-bit configuration; used in other 


SI02 


No 




configurations to select end-fill in LSBs. 


SI03 


No 






Cn 


No 




Affects arithmetic operation programmed in bits 
I3-I0 of instruction field. 
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eh 



Pass (Y *- F) with Logical Right MQ Shift 



MQSRL 



Status Signals 1 * 



ZERO = 1 if result = 0 

N = 1 if MSB of result = 1 

- 0 if MSB of result = 0 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out = 1 



Tc is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 



CO 
2 

> 

O 
H 
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EXAMPLE (assumes a 32-bit configuration) 

Add data in register 1 to data on the DB bus with carry-in and store the unshifted 
result in register 1 . Shift the contents of the MQ register one bit to the left. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


0EY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1011 0001 


00 0001 


XX XXXX 


0 10 


00 0001 


0 


0000 


10 X X 


XXXX 


0 


1 


110 



Assume register file 1 holds 5608C618 (Hex), DB bus holds 14007530 (Hex), and 
MQ register holds 98A99A0E (Hex). 



Source 0101 01100000 1000 1100 01100001 1000 R RF(1) 



Source 0001 0100 0000 0000 01 11 0101 001 1 0000 S *- DB bus 



Destination 0110 10100000 1001 0011 1011 0100 1001 RF(1) R + S + Cn 



Source 1001 1000 1010 1001 1001 1010 0000 1110 MQ shifter *- MQ register 



Destination 0100 1 100 0101 0100 1 100 1 101 0000 01 1 1 MQ register *- MQ shifter 
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NAND 



Logical NAND (R WAND S) 



FUNCTION 

Evaluates the logical expression R NAND S. 



DESCRIPTION 

Data on the R bus is NANDed with data on the S bus. The result appears at the ALU 
and MQ shifters. 

*The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14) of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 15. 



Available R Bus Source Operands 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


Yes 


No 


Yes 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





CM 
00 
00 
00 
H- 
O 
< 

I s * 
(/> 



Available Destination Operands 



RF 

(C5-C0) 


RF 
(B5-B0) 


Y-Port 


ALU 
Shifter 


MQ 
Shifter 


Yes 


No 


Yes 


Yes 


Yes 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Affect shift instructions programmed in bits I7-I4 of 


SfOO 


No 


instruction field. 


sioT 


No 




SI02 


No 




SIU3 


No 




Cn 




Inactive 
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TjT 



Logical HAND (R HAND S) 



NAND 



Status Signals t 



ZERO 
N 

OVR 
C 



1 if result = 0 
1 if MSB = 1 
0 
0 



*C is ALU carry out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 



EXAMPLE (assumes a 32-bit configuration) 

Logically NAND the contents of register 3 and register 5, and store the result 
in register 5. 

cn 
z 

> 

o 

H 

00 Assume register file 1 holds 60F6D840 (Hex) and register file 5 holds 1 3F6D377 (Hex). 
00 

CO , — , 

N) Source 01100000111101101101100001000000 R *- RF(3) 



Instr 


Oprd 


Oprd 


Oprd Set 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1111 1100 


00 0011 


00 0101 


0 00 


00 0101 


0 


0000 


10 X X 


XXXX 


0 


X 


110 



Source 0001 0011 1111 0110 1101 0011 0111 0111 



RF(5) 



Destination 11111111 0000 1001 0010 1111 1011 1111 RF(5) R NAND S 
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NOP 



No Operation 



czu 



FUNCTION 

Forces ALU output to zero. 
DESCRIPTION 

This instruction forces the ALU output to zero. The BCD flip-flops retain their old value. 
Note that the clear instruction (CLR) forces the ALU output to zero and clears the BCD 
flip-flops. 



Available R Bus Source Operands 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


No 


No 


No 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




No 


No 


No 





CM 
CO 
00 
00 

I- 
o 
< 

z 



Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 



Status Signals 
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F F 



Mo Operation 



NOP 



EXAMPLE (assumes a 32-bit configuration) 
Clear register 12. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


0EY3- 






CF2- 


I7-IO 


A5-A0 


B5-BO 


EA EBO 


C5-CO 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1111 1111 


XX XXXX 


XX XXXX 


X XX 


00 1100 


0 


0000 


10 X X 


XXXX 


0 


X 


110 



Destination oooo oooo oooo oooo oooo oooo oooo oooo RF( 1 2) *- 0 



3-122 



NOR 



Logical NOR (R NOR S) 



FUNCTION 

Evaluates the logical expression R NOR S. 
DESCRIPTION 

Data on the R bus is NORed with data on the S bus. The result appears at the ALU 
and MQ shifters. 



*The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14) of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 15. 



Available R Bus Source Operands 









C3-C0 




RF 
(A5-AO) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 




Yes 


No 


Yes 


No 




Available S Bus Source Operands 




RF 
(B5-B0) 


DB-Port 


MQ 
Register 






Yes 


Yes 


Yes 






Available Destination Operands 




RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


ALU 
Shifter 


MQ 
Shifter 


Yes 


No 


Yes 


Yes 


Yes 




CM 
CO 
00 
00 
h- 
O 
< 

z 

(A 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Affect shift instructions programmed in bits I7-I4 of 


§ioo 


No 


instruction field. 


SlST 


No 




siol 


No 




SI03 


No 




Cn 


No 


Inactive 
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cm: 



Logical MOB (R NOR S) 



NOR 



Status Signals 1, 



ZERO 
N 

OVR 
C 



1 if result = 0 
1 if MSB = 1 
0 
0 



*C is ALU carry out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 



CO 
2 
vj 

> 

o 

H 

00 
00 
CO 



EXAMPLE (assumes a 32-bit configuration) 

Logically NOR the contents of register 3 and register 5, and store the result 
in register 5. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1111 1011 


00 0011 


000101 


0 00 


00 0101 


0 


0000 


10 X X 


XXXX 


0 


X 


110 



Assume register file 3 holds 60F6D840 (Hex) and register file 5 holds 1 3F6D377 (Hex). 



Source 01100000111101101101100001000000 R RF(3) 



Source 00010011111101101101001101110111 S RF(5) 



Destination 1000 1100 0000 1001 0010 0100 1000 1000 RF(5) «- R NOR S 
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OR 



Logical OR (R OR S) 



FUNCTION 

Evaluates the logical expression R OR S. 
DESCRIPTION 

Data on the R bus is ORed with data on the S bus. The result appears at the ALU 
and MQ shifters. 

*The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14) of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 15. 



Available R Bus Source Operands 









C3-C0 




RF 
(A5-AO) 


A3-AO 
Immed 


DA-Port 


A3-A0 
Mask 




Yes 


No 


Yes 


No 




Available S Bus Source Operands 




RF 
(B5-BO) 


DB-Port 


MQ 
Register 






Yes 


Yes 


Yes 






Available Destination Operands 




RF 
(C5-C0) 


RF 
(B5-BO) 


Y-Port 


ALU 
Shifter 


MQ 
Shifter 


Yes 


No 


Yes 


Yes 


Yes 



CM 
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00 
h- 
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Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Affect shift instructions programmed in bits I7-I4 of 


sloo 


No 


instruction field. 


SlOT 


No 




SI02 


No 




SI03 


No 




Cn 


No 


Inactive 
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CUT 



logical OR (R OR S) 



OR 



Status Signals * 



ZERO = 1 if result = 0 

N = 1 if MSB = 1 

OVR =0 

C = 0 



*C is ALU carry out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 

EXAMPLE (assumes a 32-bit configuration) 

Logically OR the contents of register 5 and register 3, and store the result in 
' register s. 

CO 

z 

> 
O 
H 

00 Assume register file 5 holds 60F6D840 (Hex) and register file 3 holds 1 3F6D377 (Hex). 
00 

(a) i : : , 

N) Source 01100000111101101101100001000000 R *- RF(5) 



Instr 


Oprd 


Oprd 


Oprd Sel 


Oest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-IO 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Gn 


CFO 


1111 1011 


00 0101 


00 001 1 


0 00 


00 001 1 


0 


0000 


10 X X 


XXXX 


0 


X 


110 



Source 0001 0011 1111 0110 1101 0011 0111 0111 S 1- RF(3) 



Destination 01110011111101101101101101110111 RF(3) «- R OR S 



3-1 26 



PASS 



Pass (Y «- F) 



EE 



FUNCTION 

Passes the result of the ALU instruction specified in the upper nibble of the instruction 
field to Y MUX. 

DESCRIPTION 

The result of the arithmetic or logical operation specified in the lower nibble of the 
instruction field (13-10) is passed unshifted to Y MUX. 

*A list of ALU operations that can be used with this instruction is given in Table 15. 



Available Destination Operands 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


ALU 

Shifter 


MQ 
Shifter 


Yes 


No 


Yes 


None 


None 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


sioo 


No 


Inactive 


sioT 


No 


Inactive 


SI02 


No 


Inactive 


SI03 


No 


Inactive 


Cn 


No 


Affects arithmetic operation specified in bits I3-I0 of 
instruction field. 



CN 
CO 
CO 
00 
h- 

o 
< 

z 



Status Signals* 



ZERO = 1 if result = 0 

N = 1 if MSB of result = 1 

= 0 if MSB of result = 0 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out condition 



*C is ALU carry out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 
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rrm 



Pass (Y •*- F) 



PASS 



EXAMPLE (assumes a 32-bit configuration) 

Add data in register 1 to data on the DB bus with carry-in and store the unshifted 
result in register 10. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1111 0001 


00 0001 


XX XXXX 


0 10 


00 1010 


0 


0000 


10 X X 


XXXX 


0 


1 


110 



Assume register file 3 holds 9308C618 (Hex) and DB bus holds 24007530 (Hex). 



Source 1001 001 1 0000 1000 1 100 01 10 0001 1000 R *- RF(1 ) 
Source 



0010 0100 0000 0000 01 1 1 0101 001 1 0000 



^4 

^ Destination 

O 
H 
00 
00 
CO 



1011 0111 0000 1001 0011 1011 0100 1001 



S *- DB bus 



RF(10) - R + S + Cn 
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SDIVI 



Signed Divide Iterate 



ATT 



FUNCTION 

Performs one of N-2 iterations of nonrestoring signed division by a test subtraction 
of the N-bit divisor from the 2N-bit dividend. An algorithm using this instruction is 
given in the "Other Arithmetic Instructions" section. 

DESCRIPTION 

SDIVI performs a test subtraction of the divisor from the dividend to generate a quotient 
bit. The test subtraction passes if the remainder is positive and fails if negative. If 
it fails, the remainder will be corrected during the next instruction. 

SDIVI checks the pass/fail result of the test subtraction from the previous instruction, 
and evaluates 



F - R + S 

F - R' + S + Cn 



if the test fails 
if the test passes 



A double precision left shift is performed; bit 7 of the most significant byte of the MQ 
shifter is transferred to bit 0 of the least significant byte of the ALU shifter. Bit 7 of 
the most significant byte of the ALU shifter is lost. The unfixed quotient bit is circulated 
into the least significant bit of the MQ shifter. 

The R bus must be loaded with the divisor, the S bus with the most significant half 
of the result of the previous instruction (SDIVI during iteration or SDIVIS at the beginning 
of iteration). The least significant half of the previous result is in the MQ register. Carry- 
in should be programmed high. Overflow occurring during SDIVI is reported to OVR 
at the end of the signed divide routine (after SDIVQF). 



CNJ 
CO 
00 
00 
K 
O 
< 

z 



Available R Bus Source Operands 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


Yes 


No 


Yes 


No 


Recommended S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


No 




Recommended Destination Operands 


RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


Yes 





ALU 


MQ 


Left 


Left 
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Signed Divide Iterate 



SDIVI 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


sTOo 


No 


Pass internally generated end-fill bits. 


§ioT 


No 




§102 


No 




sio5 


No 




Cn 


Yes 


Should be programmed high 



Status Signals 



0) 
2 

•si 

> 

O 
H 
00 
00 
CO 



ZERO = 1 if intermediate result 

N = 0 

OVR = 0 

C = 1 if carry-out 
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SDIVIN 



Signed Divide Initialize 



ittti 



FUNCTION 

Initializes 'ACT8832 for nonrestoring signed division by shifting the dividend left and 
internally preserving the sign bit. An algorithm using this instruction is given in the 
"Other Arithmetic Instructions section. 



DESCRIPTION 

This instruction prepares for signed divide iteration operations by shifting the dividend 
and storing the sign for future use. 

The preceding instruction should load the MQ reqister with the least significant half 
of the dividend. During SDIVIN, the S bus should be loaded with the most significant 
half of the dividend, and the R bus with the divisor. Y-output should be written back 
to the register file for use in the next instruction. 

A double precision logical left shift is performed; bit 7 of the most significant byte 
of the MQ shifter is transferred to bit 0 of the least significant byte of the ALU shifter. 
Bit 7 of the most significant byte of the ALU shifter is lost. The unfixed quotient sign 
bit is shifted into the least significant bit of the MQ shifter. 

Available R Bus Source Operands 



RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


C3-C0 

A3-A0 
Mask 


Yes 


No 


Yes 


No 



CN 
CO 
00 
00 

h- 
o 
< 

Z 
C/> 



Recommended S Bus Source Operands 



RF 
(B5-B0) 


DB-Port 


MQ 
Register 


Yes 


Yes 


No 



Recommended Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-BO) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


Left 


Left 
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J_|_0J Signed Divide Initialize SDiVIH 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


SIO0 


No 


Pass internally generated end-fill bits. 


sioT 


No 




SI02 


No 




Si03 


No 




Cn 


No 


Inactive 


Status Signals 


ZERO = 


1 if divisor = 0 




N = 


0 




OVR = 


0 




Cn = 


0 
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SDIVIS 



Signed Divide Start 



0 



FUNCTION 

Computes the first quotient bit of nonrestoring signed division. An algorithm using 
this instruction is given in the ' 'Other Arithmetic Instructions" section.. 

DESCRIPTION 

SDIVIS computes the first quotient bit during nonrestoring signed division by subtracting 
the divisor from the dividend, which was left-shifted during the prior SDIVIN instruction. 
The resulting remainder due to subtraction may be negative. If so, the subsequent 
SDIVI instruction will restore the remainder during the next subtraction. 

The R bus must be loaded with the divisor and the S bus with the most significant 
half of the remainder. The result on the Y bus should be loaded back into the register 
file for use in the next instruction. The least significant half of the remainder is in the 
MQ register. Garry-in should be programmed high. 

A double precision left shift is performed; bit 7 of the most significant byte of the 
MQ shifter is transferred to bit 0 of the least significant byte of the ALU shifter. Bit 7 H 
of the most significant byte of the ALU shifter is lost. The unfixed quotient bit is ^2 
circulated into the least significant bit of the MQ shifter. ^ 

Overflow occurring during SDIVIS is reported to OVR at the end of the signed division 2 
routine (after SDIVQF). (/) 



Available R Bus Source Operands 









C3-C0 


RF 


A3-A0 








DA-Port 




(A5-A0) 


Immed 




A3-A0 








Mask 


Yes 


No 


Yes 


No 



Recommended S Bus Source Operands 



RF 
(B5-B0) 


DB-Port 


MQ 
Register 


Yes 


Yes 


No 



Recommended Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


Left 


Left 
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9 0 



Signed Divide Start 



SDIVIS 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


SIOO 


No 


Pass internally generated end-fill bits. 


sioT 


No 




Sf02 


No 




SI03 


No 




Cn 


Yes 


Should be programmed high. 



Status Signals 



ZERO =1 if intermediate result 

N = 0 

OVR =0 

C = 1 if carry-out 
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SDIVIT 



Signed Divide Terminate 



cm 



FUNCTION 

Solves the final quotient bit during nonrestoring signed division. An 
algorithm using this instruction is given in the "Other Arithmetic Instructions" section. 

DESCRIPTION 

SDIVIT performs the final subtraction of the divisor from the remainder during 
nonrestoring signed division. SDIVIT is preceded by N-2 iterations of SDIVI, where 
N is the number of bits in the dividend. 

The R bus must be loaded with the divisor, and the S bus must be loaded with the 
most significant half of the result of the last SDIVI instruction. The least significant 
half lies in the MQ register. The Y bus result must be loaded back into the register 
file for use in the subsequent DIVRF instruction. Carry-in should be programmed high. 



CM 



SDIVIT checks the pass/fail result of the previous instruction's test subtraction and 
evaluates; 00 

H 

Y *- R + -S if the test fails O 

Y R' + S + Cn if the test passes <C 

The contents of the MQ register are shifted one bit to the left; the unfixed quotient 
bit is circulated into the least significant bit. </) 

Overflow during this instruction is reported to OVR at the end of the signed division 
routine (after SDIVQF). 



Available R Bus Source Operands 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-AO 
Mask 


Yes 


No 


Yes 


No 


Recommended S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


No 




Recommended Destination Operands 


RF 
(C5-C0) 


RF 
(B5-BO) 


Y-Port 




Yes 


No 


Yes 





ALU 


MQ 


Left 


Left 
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E 0 



Signed Divide Terminate 



SDIVIT 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


3155 


No 


Pass internally generated end-fill bits. 


SioT 


No 




ST52 


No 




SI03 


No 




Cn 


Yes 


Should be programmed high 



Status Signals 



ZERO = 1 if intermediate result 

N = 0 

OVR =0 

C = 1 if carry-out 
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SDIVO 



Signed Divide Overflow Test 



FUNCTION 

Tests for overflow during nonrestoring signed division. An algorithm using this 
instruction is given in the ' 'Other Arithmetic Instructions section. 



DESCRIPTION 



This instruction performs an initial test subtraction of the divisor from the dividend. 
If overflow is detected, it is preserved internally and reported at the end of the divide 
routine (after SDIVQF). If overflow status is ignored, the SDIVO instruction may be 
omitted. 

The divisor must be loaded onto the R bus; the most significant half of the previous 
SDIVIN result must be loaded onto the S bus. The least significant half is in the MQ 
register. 

The result on the Y bus should not be stored back into the register file; WE' should 
be programmed high. 

Carry-in should also be programmed high. 
Available R Bus Source Operands 









C3-C0 


RF 


A3-A0 










DA-Port 




(A5-A0) 


Immed 




A3-A0 








Mask 


Yes 


No 


Yes 


No 



CM 
CO 
00 
00 
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o 
< 

2 
(/) 



Recommended S Bus Source Operands 



RF 
(B5-B0) 


DB-Port 


MQ 
Register 


Yes 


Yes 


No 



Recommended Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 
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Signed Divide Overflow Test 



SDIVO 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


§ISo 


No 


Inactive 


SlOT 


No 


Inactive 


SI02 


No 


Inactive 


5103 


No 


Inactive 


Cn 


Yes 


Should be programmed high 


Status Signals 


ZERO = 


1 if divisor = 0 




N = 


0 




OVR = 


0 




C = 


1 if carry-out 





cn 

2 

> 
O 
H 
00 
00 
O) 
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SDIVQF 



Signed Divide Quotient Fix 



FUNCTION 

Tests the quotient result after nonrestoring signed division and corrects it if necessary. 
An algorithm using this instruction is given in the "Other Arithmetic Instructions" 
section. 



DESCRIPTION 

SDIVQF is the final instruction required to compute the quotient of a 2N-bit dividend 
by an N-bit divisor. It corrects the quotient if the signs of the divisor and dividend are 
different and the remainder is nonzero. 

The fix is implemented by incrementing S: 

Y S + 1 if a fix is required 

Y *- S + 0 if no fix is required 

The R bus must be loaded with the divisor, and the S bus with the most significant 
half of the result of the preceding DIVRF instruction. The least significant half is in 
the MQ register. 

Available R Bus Source Operands 









C3-C0 


RF 


A3-A0 










DA-Port 




(A5-A0) 


Immed 


A3-A0 








Mask 


Yes 


No 


Yes 


No 



CM 
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00 
00 
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< 
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C0 



Recommended S Bus Source Operands 



RF 
(B5-B0) 


DB-Port 


MQ 
Register 


Yes 


Yes 


No 



Recommended Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 
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he 



Signed Divide Quotient Fix 



SDIVQF 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


SIOO 


No 


Inactive 


sioT 


No 


Inactive 




No 


Inactive 


3103 


No 


Inactive 


Gn 


Yes 


Should be programmed high 



Status Signals 



ZERO = 1 if quotient = 0 

N = 1 if sign of quotient + 1 

= 0 if sign of quotient + 0 

OVR = 1 if divide overflow 

C = 1 if carry-out 
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SEL Select S/R 1 11 0 



FUNCTION 

Selects S if SSF is high; otherwise selects R. 
DESCRIPTION 

Data on the S bus is passed to Y if SSF is programmed high or floating; data on the 
R bus is passed without carry to Y if SSF is programmed low. 



Available R Bus Source Operands 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


Yes 


No 


Yes 


No 


Available S Bus Source 
Operands (MSH) 




RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 




Available Destination Operands 


RF 

(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


Yes 





Shift Operations 



ALU 


MQ 


None 


None 
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Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


Yes 


Selects S if high, R if low. 


SIO0 


No 


Inactive 


ST5T 


No 


Inactive 


SI02 


No 


Inactive 


SIS3 


No 


Inactive 


Cn 


No 


Inactive 
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rrm 



Select S/R 



SEL 



Status Signals 



ZERO 
N 

OVR 
C 



1 if result = 0 
1 if MSB = 1 
0 
0 




z 

> 

o 

H 
00 
00 
W 



EXAMPLE (assumes a 32-bit configuration) 

Compare the two's complement numbers in registers 1 and 3 and store the larger in 
register 5. 

1 . Subtract (SUBS) data in register 3 from data in register 1 and pass the result 
to the Y bus. 

2. Perform Select S/R instruction and pass result to register 5. 

[This example assumes the SSF is set by the negative status (N) from the previous 
instruction]. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1 1 1 1 001 1 


00 0001 


00 001 1 


0 00 


XX XXXX 


0 


XXXX 


XX X X 


0000 


0 


1 


110 


0001 0000 


00 0001 


00 001 1 


0 00 


00 0101 


0 


0000 


10 X X 


XXXX 


0 


0 


110 



Assume register file 1 holds 008497D0 (Hex) and register file 3 holds 01 C35250 (Hex). 
Instruction Cycle 1 



Source oooo oooo 1000 0100 1001 01 1 1 1 101 oooo R *- RF(1 ) 



Source oooo 0001 1 100 001 1 0101 0010 0101 oooo S *- RF(3) 



Destination 1111 1110 11000001 01000101 10000000 Y bus *- R + S' + Cn 



□ N 



Instruction Cycle 2 

Source 



00000000 10000100 1001 0111 noi oooo I R*-RFO) 



pi] SSF- 1 

Source oooo 0001 11000011 0101 00100101 oooo | S RF(3) 

RF(5) - S 



Destination 
3-142 



00000001 11000011 0101 00100101 oooo 



SETO 



Reset Bit 



on 



FUNCTION 

Resets bits in selected bytes of S-bus data using mask in C3-C0::A3-A0. 
DESCRIPTION 

The register addressed by B5-BO is both the source and destination for this instruction. 
The source word is passed on the S bus to the ALU, where it is compared to an 8-bit 
mask, consisting of a concatenation of the C3-G0 and A3-A0 address ports 
(C3-C0::A3-A0). The mask is input via the R bus. All bits in the source word that are 
in the same bit position as ones in the mask are reset. Bytes with their SIO inputs 
programmed low perform the Reset Bit instruction. Bytes with their SIO inputs 
programmed high or floating pass S unaltered. 



Available R Bus Source Operands 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


No 


No 


No 


Yes 


Available S Bus Source 
Operands (MSH) 




RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 




Available Destination Operands 


RF 

(C5-C0) 


RF 
(B5-B0) 


Y-Port 




No 


Yes 


Yes 





Shift Operations 



CM 
CO 
00 
00 
H 
O 
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ALU 


MQ 


None 


None 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


sioo 


No 


Byte-select 


sioi 


No 


Byte-select 


§102 


No 


Byte-select 


S?03 


No 


Byte-select 


Cn 


No 


Inactive 
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Reset Bit 



SETO 



Status Signals 



ZERO = 1 if result {selected bytes) 

N = 0 

OVR = 0 

C = 0 



EXAMPLE (assumes a 32-bit configuration) 

Set bits 3-0 of bytes 1 and 2 of register file 8 to zero and store the result back in 
register 8. 



Instr 


Mask 


Oprd 


Oprd Sel 


Mask 






Destination Selects 














Code 


(LSH) 


Addr 


EB1- 


(MSH) 




WE3- 


SELRF1- 


OEY3- 






CF2- 


SI03- 


IESI03- 


17-10 


A3-A0 


B5-B0 


EA EBO 


C3-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


SIOO 


IESIO0 


0001 1000 


1111 


00 1000 


X 00 


0000 


0 


0000 


10 X X 


XXXX 


0 


X 


110 


1001 


0000 



0000 1111 oooo 1111 oodo 1111 0000 1 111 



4* Assume register file 8 holds A083BEBE (Hex). 
> 

^ Source 

00 
00 

10 Source 



1010 0000 1000 0011 1011 1110 1011 1110 



ALU 
Destination 



1010 0000 1000 0000 101 1 0000 101 1 1110 



1010 0000 1 000 0000 1011 0000 1011 1110 



Rn - C3-C0::A3-A0 
Sn *- RF(3)n 
Fn — Sn AND Rn 
RF(8)n - Fn or Sn* 



f F = ALU result 
n = nth byte 

Register file 8 gets F if byte selected, S if byte not selected. 
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SET1 



Set Bit 



nrm 



FUNCTION 

Sets bits in selected bytes of S-bus data using mask in C3-C0::A3-A0. 
DESCRIPTION 

The register addressed by B5-B0 is both the source and destination for this instruction. 
The source word is passed on the S bus to the ALU, where it is compared to an 8-bit 
mask, consisting of a concatenation of the C3-C0 and A3-A0 address ports 
(C3-C0::A3-A0). The mask is input via the R bus. All bits in the source word that are 
in th e sam e bit position as ones in the mask are forced to a logical one. Bytes with 
thei r SIO inputs programmed low perform the Set Bit instruction. Bytes with their 
SIO inputs programmed high or floating pass S unaltered. 

Available R Bus Source Operands 










C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


No 


No 


No 


Yes 


Available S Bus Source 
Operands (MSH) 




RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 




Available Destination Operands 


RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 




No 


Yes 


Yes 





CM 
00 
00 
00 
h- 
O 
< 

z 

CO 



Shift Operations 



ALU 


MQ 


None 


None 



Control/Data Signals 



Signal 



User 
Programmable 



Use 



SSF 

SIOO 

SIOT 

SI02 

Si03 

Cn 



No 

Yes 

No 

No 

No 

No 



Inactive 

Byte-select 

Byte-select 

Byte-select 

Byte-select 

Inactive 
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TT8 



Status Signals 



Set Bit 



SET1 



ZERO =1 if result (selected bytes) = 0 

N = 0 

OVR = 0 

C = 0 



EXAMPLE (assumes a 32-bit configuration) 

Set bits 3-0 of byte 1 of register file 1 to zero and store the result back in register 1 . 




z 

> 

O 
H 
00 
00 
CO 



Instr 


Mask 


Oprd 


Oprd Sel 


Mask 


Destination Selects 














Code 


(LSH) 


Addr 


EB1- 


(MSH) 


WE3- SELRF1- 


OEY3- 






CF2- 


SI03- 


IESI03- 


I7-I0 


A3-A0 


B5-B0 


EA EBO 


C3-C0 


SELMQ WEO SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


SIOO 


IESIOO 


0000 1000 


1111 


00 0001 


X 00 


0000 


0 0000 10 X X 


XXXX 


0 


X 


110 


1101 


0000 



Assume register file 8 holds A083BEBE (Hex). 
Source 



0000 1111 0000 1111 0000 1111 0000 1111 



Source 
ALU 
Destination 



1010 0000 1000 0011 1011 1110 1011 1110 



1010 0000 1000 0011 1011 1111 1011 1110 



1010 0000 1000 0011 1011 1111 1011 1110 



Rn «- C3-C0::A3-A0 
Sn RF(1)n 
Fn *- Sn OR Rn 
RF(1)n - Fn or Sn 1 " 



T F = ALU result 
n = nth byte 

Register file 1 gets F if byte selected, S if byte not selected. 
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SLA 



Arithmetic Left Single Precision Shift 



FUNCTION 

Performs arithmetic left shift on result of ALU operation specified in lower nibble of 
instruction field. 



DESCRIPTION 

The result of the ALU operation specified in instruction bits 13-10 is shifted one bit 
to t he le ft. A zero is filled into bit 0 of the least significant byte of each word unless 
the SIO input is programmed low; this will force bit 0 to one. Bit 7 is dropped from 
the most significant byte in each word, which may be 1,2, or 4 bytes long, depending 
on the configuration selected. 

The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the MQ register. If SSF is low, the MQ register will not be altered. 

*A list of ALU operations that can be used with this instruction is given in Table 15. 
Shift Operations 



ALU Shifter 


MQ Shifter 


Arithmetic Left 


None 



Available Destination Operands (ALU Shifter) 



CM 
CO 
00 
00 
f— 

o 
< 

r> 
z 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


Yes 




Control/Data Signals 




Signal 


User 
Programmable 


Use 


SSF 


Yes 




Passes shift result if high; passes ALU result if low. 


SIOO 


Yes 




Fills a zero in LSB of each word if high; fills a 


SioT 


Yes 




one in LSB if low. 


SI02 


Yes 






SI03 


Yes 






Cn 


No 




Affects arithmetic operation programmed in bits 
I3-I0 of instruction field. 
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Arithmetic Left Single Precision Shift 



SLA 



Status Signals* 



ZERO = 1 if result = 0 

N = 1 if MSB of result = 1 

= 0 if MSB of result = 0 

OVR = 1 if signed arithmetic overflow or if MSB XOR MSB-1 

C = 1 if carry-out condition 



1 before shift 



2 

> 

O 
H 
00 
00 
CO 
N) 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operatibn. 

EXAMPLE (assumes a 32-bit configuration) 

Perform the computation A = 2(A + B), where A and B are single-precision, two's 
complement numbers. Let A be stored in register 1 and B be input via the DB bus. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 
















Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


SI03- 


IESI03- 




I7-I0 


A5-A0 


B5-B0 


1a ebo 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


sioo 


IESI00 


SSF 


0100 0001 


00 0001 


XX XXXX 


0 10 


00 0001 


0 


0000 


10 X X 


XXXX 


0 


0 


110 


1110 


0000 


1 



Assume register file 1 holds 1308C618 (Hex), DB bus holds 44007530 (Hex). 



Source 0001 0011 oooo 1000 110001100001 1000 R;*- RF(1) 
Source 



0100 0100 0000 0000 01 11 0101 001 1 0000 



S *- DB bus 



Intermediate 
Result 



0101 0111 0000 1001 0011 1011 0100 1000 



ALU Shifter «- R + S + Cn 



Destination 1010 11100001 00100111 0110 1001 0001 RF(1) *- ALU shift result 
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SLAD 



Arithmetic Left Double Precision Shift 1 5 1 1 



FUNCTION 

Performs arithmetic left shift on MQ register (LSH) and result of ALU operation (MSH) 
specified in lower nibble of instruction field. 

DESCRIPTION 

The result of the ALU operation specified in instruction bits 13-10 is used as the upper 
half of a double-precision word, the contents of the MQ register as the lower half. 

The contents of the MQ register are shifted one bit to th e left . A zero is filled into 
bit 0 of the least significant byte of each word unless the SIO input for the word is 
set to zero; this will force bit 0 to one. Bit 7 of the most significant byte in the MQ 
shifter is passed to bit 0 of the least significant byte of the ALU shifter. Bit 7 of the 
most significant byte in the ALU shifter is dropped. 

The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the Y MUX and MQ register. If SSF is low, the ALU output and MQ 
register will not be altered. 

*A list of ALU operations that can be used with this instruction is given in Table 15. 



Shift Operations 



ALU Shifter 


MQ Shifter 


Arithmetic Left 


Arithmetic Left 



Available Destination Operands (ALU Shifter) 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


Yes 


Passes shift result if high; passes ALU result if low. 


SIOO 


Yes 


Fills a zero in LSB of each word if high; fills a 


SlST 


Yes 


one in LSB if low. 


SI02 


Yes 




siM 


Yes 




Cn 


No 


Affects arithmetic operation specified in bits I3-I0 of 
instruction field. 
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CN 
CO 
00 
00 
h- 

o 
< 



CO 



n 



Arithmetic Left Double Precision Shift 



SLAD 



Status Signals 1 



ZERO = 1 if result = 0 

N = 1 if MSB of result = 1 

= 0 if MSB of result = 0 

OVR = 1 if signed arithmetic overflow or if MSB XOR MSB-1 = 1 before shift 

C = 1 if carry-out condition 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 



c/> 

> 

O 
H 
00 
00 
CO 

ro 



EXAMPLE (assumes a 32-bit configuration) 

Perform the computation A = 2(A + B), where A and B are two's complement numbers. 
Let A be a double precision number residing in register 1 (MSH) and the MQ register 
(LSH). Let B be a single precision number which is input through the DB bus. 



instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 
















Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


SI03- 


IESI03- 




I7-I0 


A5-A0 


B5-B0 


Ia ebo 


C5-C0 


SELMQ 


WEO 


SELRFO OLA OEB 


OEYO 


OES 


Cn 


CFO 


SIOO 


IESIO0 


SSF 


0101 0001 


00 0001 


XX XXXX 


0 10 


00 0001 


0 


0000 


10 X X 


XXXX 


0 


0 


110 


1110 


0000 


1 



Assume register file 1 holds 2408C618 (Hex), DB bus holds 26007530 (Hex), and 
MQ register holds 50A99A0E (Hex). 

MSH 



Source 0010 0100 oooo 1000 1100 0110 0001 1000 R RF(1 



Source 00100110000000000111010100110000 S *- DB bus 



Intermediate 
Result 



0100 10100000 1001 0011 1011 0100 1000 



ALU Shifter *- R + S + Cn 



Destination 
LSH 



1001 0100 0001 00100111 0110 1001 0000 



RF(1) — ALU shift register 



Source 0101 0000 1010 1001 1001 10100000 1110 MQ shifter +- MQ register 



Destination 1010 0001 0101 0011 0011 0100 0001 1101 MQ register MQ shift result 
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SLC 



Circular Left Single Precision Shift 



FUNCTION 

Performs circular left shift on result of ALU operation specified in lower nibble of 
instruction field. 

DESCRIPTION 

The result of the ALU operation specified in instruction bits 13-10 is rotated one bit 
to the left. Bit 7 of the most significant byte in each word is passed to bit 0 of the 
least significant byte in the word, which may be 1, 2, or 4 bytes long. 

The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to Y MUX. If SSF is low, F is passed unaltered. 

*A list of ALU operations that can be used with this instruction is given in Table 15. 
Shift Operations 



ALU Shifter 


MQ Shifter 


Circular Left 


None 



Available Destination Operands (ALU Shifter) 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


Yes 




Control/Data Signals 




Signal 


User 
Programmable 


Use 


SSF 


Yes 




Passes shift result if high; passes ALU result if low. 


SIO0 


No 




Bit 7 of ALU result 


SI01 


No 




Bit 1 5 of ALU result 


SI02 


No 




Bit 23 of ALU result 


SI03 


No 




Bit 31 of ALU result 


Cn 


No 




Affects arithmetic operation specified in bits I3-I0 of 
instruction field. 



CM 
CO 
00 
00 
h- 

o 
< 

z 
</> 
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Circular left Single Precision Shift 



SIC 



Status Signals* 



ZERO = 1 if result = 0 

N = 1 if MSB of result = 1 

= 0 if MSB of result = 0 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out condition 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 



EXAMPLE (assumes a 32-bit configuration) 

Perform a circular left shift of register 6 and store the result in register 1 . 



2 

> 

O 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 




Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- SELRF1- 


OEY3- 




CF2- 




I7-I0 


A 5- AO 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO SELRFO OEA OEB 


0EY0 OES 


Cn 


CFO 


SSF 


0110 0110 


00 0110 


XX XXXX 


0 00 


00 0001 


0 


0000 10 XX 


XXXX 0 


0 


110 


1 



00 Assume register file 6 holds 3788C618 (Hex). 
00 

GO r— 

fO Source 0011 0111 1000 1000 110001100001 1000 R *- RF(6) 



Intermediate 
Result 



0011 0111 10001000110001100001 1000 ALU Shifter *- R + Cn 



Destination 0110 1111 0001 0001 1000 11000011 0000 RF(1) *- ALU shifter result 
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SLCD 



Circular Left Double Precision Shift 



FUNCTION 

Performs circular left shift on MQ register (LSH) and result of ALU operation specified 
in lower nibble of instruction field (MSH). 

DESCRIPTION 

The result of the ALU operation specified in instruction bits 13-10 is used as the upper 
half of a double-precision word, the contents of the MQ register as the lower half. 

The contents of the MQ and ALU registers are rotated one bit to the left. Bit 7 of the 
most significant byte in the MQ shifter is passed to bit 0 of the least significant byte 
of the ALU shifter. Bit 7 of the most significant byte is passed to bit 0 of the least 
significant byte in the MQ shifter. 

The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to Y MUX. If SSF is low, F is passed unaltered and the MQ register is 
not changed. 

*A list of ALU operations that can be used with this instruction is given in Table 1 5. 



Shift Operations 



ALU Shifter 


MQ Shifter 


Circular Left 


Circular Left 



Available Destination Operands (ALU Shifter) 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


Yes 




Control/Data Signals 




Signal 


User 
Programmable 


Use 


SSF 


Yes 




Passes shift result if high; passes ALU result if low. 


sioo 


No 




Bit 7 of ALU result 


SIOT 


No 




Bit 1 5 of ALU result 


SI02 


No 




Bit 23 of ALU result 


SI03 


No 




Bit 31 of ALU result 


Cn 


No 




Affects arithmetic operation specified in bits I3-I0 of 
instruction field. 
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n 



Circular Left Double Precision Shift 



SLCD 



Status Signals* 



ZERO = 1 if result = 0 

N = 1 if MSB of result = 1 

= 0 if MSB of result = 0 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out condition 



2 

> 

O 
H 
00 
CO 
GO 
N) 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 

EXAMPLE (assumes a 32-bit configuration) 

Perform a circular left double precision shift of data in register 6 (MSH) and MQ (LSH), 
and store the result back in register 6 and the MQ register. 



Instr 
Code 
I7-I0 


Oprd 
Addr 
A5-A0 


Oprd 
Addr 
B5-B0 


Oprd Sel 
EB1- 
EA EBO 


Dest 
Addr 
C5-C0 


Destination Selects 
WE3- SELRF1- OEY3- 
SELMQ WEO SELRFO OEA OEB 0EY0 OES 


Gn 


CF2- 
CFO 


SSF 


0111 0110 


00 0110 


XX XXXX 


0 00 


00 0110 


0 0000 10 X X XXXX 0 


0 


110 


1 



Assume register file 6 holds 3708C61 8 (Hex) and MQ register holds 50A99A0E (Hex). 
MSH 



Source 

Intermediate 
Result 

Destination 

LSH 



001 1 01 1 1 0000 1000 1 100 01 10 0001 1000 



R - RF(6) 



001 1 01 1 1 oooo 1000 1 100 01 10 0001 1000 ALU Shifter *~ R + Cn 

RF(6) *- ALU shifter result 



0110 1111 0001 0001 1000 1100 0011 0000 



Source 0101 oooo 1010 1001 1001 10100000 1110 MQ register *- MQ register 



Destination 1010 0001 0101 0011 0011 0100 0001 1100 MQ register +~ MQ shift result 
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SMTC 



Sign Magnitude/Two's Complement 



518 



FUNCTION 

Converts data on the S bus from sign magnitude to two's complement or vice versa. 
DESCRIPTION 

The S bus provides the source word for this instruction. The number is converted by 
inverting S and adding the result to the carry-in, which should be programmed high 
for proper conversion; the sign bit of the result is then inverted. An error condition 
will occur if the source word is a negative zero (negative sign and zero magnitude). 
In this case, SMTC generates a positive zero, and the OVR pin is set high to reflect 
an illegal conversion. 

The sign bit of the selected operand in the most significant byte is tested; if it is high, 
the converted number is passed to the destination. Otherwise the operand is passed 
unaltered. 

Available R Bus Source Operands 









C3-C0 


RF 


A3-A0 






(AS- AO) 


DA-Port 




Immed 


A3-A0 








Mask 


No 


No 


No 


No 



<M 
CO 
00 
00 
H 
O 
< 

CO 



Available S Bus Source Operands 



RF 
(B5-B0) 


DB-Port 


MQ 
Register 


Yes 


Yes 


Yes 



Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 
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irnn 



Sigm Magnitude/Two's Complement 



SMTC 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


sloo 


No 


Inactive 


sioT 


No 


Inactive 


Si02 


No 


Inactive 


SI03 


No 


Inactive 


Cn 


Yes 


Should be programmed high for proper conversion 



Status Signals 



Z 
> 

o 

H 
00 
00 
GO 

M EXAMPLES (assumes a 32-bit configuration) 

Convert the two's complement number in register 1 to sign magnitude representation 
and store the result in register 4. 



ZERO = 1 if result = 0 

N = 1 if MSB = 1 
OVR = 1 if input of most significant byte is 80 (Hex) and results in all other 
bytes are 00 (Hex). 
C = 1 if S = 0 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-I0 


A 5- AO 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


0101 1000 


XX XXXX 


00 0001 


X 00 


00 0100 


0 


0000 


10 X X 


XXXX 


0 


1 


110 



Example 1: Assume register file 1 holds C3F6D840 (Hex). 



Source 1100 0011 1111 0110 1101 100001000000 S *- RF(T) 



Destination 1011 11000000100100100111 11000000 RF(4) «- S' + Cn 



Example 2: Assume register file 1 holds 550927C0 (Hex). 



Source 0101 0101 0000 1001 00100111 11000000 S *- RF(1 



Destination 0101 0101 0000 1001 00100111 11000000 RF(4) *- S 
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SMULI 



Signed Multiply Iterate 



fTTo 



FUNCTION 

Computes one of N-1 signed or N mixed multiplication iterations for computing an 
N-bit by N-bit product. Algorithms for signed and mixed multiplication using this 
instruction are given in the ' 'Other Arithmetic Instructions" section. 

DESCRIPTION 

SMULI checks to determine whether the multiplicand should be added with the present 
partial product. The instruction evaluates: 

F R + S + Cn if the addition is required 
F S if no addition is required 

A double precision right shift is performed. Bit 0 of the least significant byte of the 
ALU shifter is passed to bit 7 of the most significant byte of the MQ shifter; carry-out 
is passed to the most significant bit of the ALU shifter. 

The S bus should be loaded with the contents of an accumulator and the R bus with 
the multiplicand. The Y bus result should be written back to the accumulator after 
each iteration of UMULI. The accumulator should be cleared and the MQ register loaded 
with the multiplier before the first iteration. 

Available R Bus Source Operands 



CM 
CO 
00 
00 
h- 
U 
< 

z 

CO 









C3-C0 


RF 
(A5-AO) 


A3-A0 
Immed 


DA- Port 


A3-A0 
Mask 


Yes 


No 


Yes 


No 


Recommended S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


No 




Recommended Destination Operands S 


RF 
(C5-C0) 


RF 

(B5-B0) 


Y-Port 




Yes 


No 


No 





ALU 


MQ 


Right 


Right 
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ITTT 



Signed Multiply Iterate 



SMULI 



Control/Data Signals 



00 

2 

^4 

> 

O 
H 
00 
00 
Ca) 




Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


SIO0 


No 


Passes LSB from ALU shifter to MSB of MQ shifter. 


SIOT 


No 




SI02 


No 




SI03 


No 




Cn 


Yes 


Should be programmed low 


Status Signals 


ZERO = 


1 if result = 0 




N = 


1 if MSB = 1 




OVR = 


0 




C = 


1 if carry-out 
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SMULT 



Signed Multiply Terminate 



FUNCTION 

Performs the final iteration for computing an N-bit by N-bit signed product. An algorithm 
for signed multiplication using this instruction is given in the "other Arithmetic 
Instructions" section. 



DESCRIPTION 

SMULI checks the present multiplier bit (the least significant bit of the MQ register) 
to determine whether the multiplicand should be added with the present partial product. 
The instruction evaluates: 

F *- R' + S + Cn if the addition is required 
F S if no addition is required 

with the correct sign in the product. CM 

CO 

A double precision right shift is performed. Bit 0 of the least significant byte of the 00 
ALU shifter is passed to bit 7 of the most significant byte of the MQ shifter. £2 

O 

The S bus should be loaded with the contents of an register file holding the previous ^ 
iteration result; the R bus must be loaded with the multiplicand. After executing SMULT, 
the Y bus contains the most significant half of the product, and MQ contains the least ^ 
significant half. (/) 



Available R Bus Source Operands 









C3-C0 


RF 


A3-A0 


DA-Port 










(A5-A0) 


Immed 




A3-A0 








Mask 


Yes 


No 


Yes 


No 



Recommended S Bus Source Operands 



RF 
(B5-B0) 


DB-Port 


MQ 
Register 


Yes 


Yes 


No 



Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


No 



ALU 


MQ 


Right 


Right 
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7 0 



Signed Multiply Terminate 



SMULT 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


5ioo 


No 


Passes LSB from ALU shifter to MSB of MQ shifter. 


sIoT 


No 




§102 


No 




SI03 


No 




Cn 


Yes 


Should be programmed low 


Status Signals 


ZERO = 


1 if result = 0 




N = 


1 if MSB = 1 




OVR = 


0 




C = 


1 if carry-out 
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SNORM 



Single-Length Normalize 



2 



0 



FUNCTION 

Tests the two most significant bits of the MQ register. If they are the same, shifts 
the number to the left. 



DESCRIPTION 

This instruction is used to normalize a two's complement number in the MQ register 
by shift ing the number one bit position to the left and filling a zero into the LSB (unless 
the SIO input for that word is low). Data on the S bus is added to the carry, permitting 
the number of shifts performed to be counted and stored in one of the register files. 

The shift and the S bus increment are inhibited whenever normalization is attempted 
on a number already normalized. Normalization is complete when overflow occurs. 



Available R Bus Source Operands 









C3-C0 


RF 


A3-AO 








DA-Port 




(A5-A0) 


Immed 


A3-A0 








Mask 


No 


No 


No 


No 



CO 
00 
00 

I- 
o 
< 

z 



Available S Bus Source Operands (Count) 



RF 
(B5-BO) 


DB-Port 


MQ 
Register 


Yes 


No 


No 



Available Destination Operands Shift Operations 
(Count) (Conditional) 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


No 


Left 
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ITTo 



Single-Length Normalize 



SNORM 



Control/Data Signals 





User 




Signal 


Programmable 


Use 


SSF 


No 


Inactive 


siSo 


No 


Passes internally generated end-fill bit. 


SRJT 


No 




SI02 


No 




Sl03 


No 




Cn 


Yes 


Increments S bus (shift count) if set to one. 



Status Signals 



ZERO 
N 

OVR 
C 



1 if result = 0 

1 if MSB of MQ register = 1 

1 if MSB of MQ register XOR 2nd MSB = 1 

1 if carry-out = 1 



z 

> 

o 

H 

EXAMPLE (assumes a 32-bit configuration) 

Cs) 

jnj Normalize the number in the MQ register, storing the number of shifts in register 3. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Oest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Gn 


CFO 


0010 0000 


XX XXXX 


00 001 1 


X 00 


00 001 1 


0 


0000 


10 X X 


XXXX 


0 


1 


110 



Assume register file 3 holds 00000003 (Hex) and MQ register holds 3699D84E (Hex). 
Operand 



Destination 
Count 



Source 0011 0110 100.1 1001 1101 1000 0100 1110 MQ shifter *- MQ register 

MQ register **- MQ shifter 



0110 1101 0011 0011 101.1 0000 1001 1100 



Source 0000 0000 0000 0000 0000 0000 0000 001 1 S «~ RF{3) 



Destination 00000000000000000000000000000100 RF{3) S + Cn 



3-162 



SRA Arithmetic Right Single Precision Shift 



FUNCTION 



Performs arithmetic right shift on result of ALU operation specified in lower nibble of 
instruction field. 

DESCRIPTION 

The result of the ALU operation specified in instruction bits 13-10 is shifted one bit 
to the right. The sign bit of the most significant byte is retained unless it is inverted 
as a result of overflow. Bit 0 of the least significant byte is dropped. 

The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the Y MUX. If SSF is low, the ALU result will be passed unshifted to 
the Y MUX. 

*A list of ALU operations that can be used with this instruction is given in Table 15. 
Shift Operations 



ALU Shifter 


MQ Shifter 


Arithmetic Right 


None 



Available Destination Operands (ALU Shifter) 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


Yes 




Control/Data Signals 




Signal 


User 
Programmable 


Use 


SSF 


Yes 




Passes shifted output if high; passes ALU result 
if low. 


SlOO 


No 




LSB is shifted out from each word, which may be 


SIOT 


No 




1, 2, or 4 bytes long depending on selected 


Si02 


No 




configuration 


§103 


No 






Cn 


No 




Affects arithmetic operation specified in bits I3-I0 of 
instruction field. 




CM 
CO 
00 
00 

o 
< 

Z 
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Arithmetic Right Single Precision Shift 



SRA 



Status Signals* 




ZERO = 1 if result = 0 

N = 1 if MSB of result = 1 

= 0 if MSB of result = 0 

OVR = 0 

C = 1 if carry-out condition 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 

EXAMPLE (assumes a 32-bit configuration) 

Perform the computation A = (A + B)/2, where A and B are single-precision numbers. 
CO Let A reside in register 1 and B be input via the DB bus. 

> 

O 
H 
00 

2 Assume register file 1 holds 6A08C618 (Hex) and DB bus holds 51007530 (Hex). 

ro i n 

' n R«-RF(1) 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 


Destination Selects 












Code 


Addr 


Addr 


EB1- 


Addr 


WE3- SELRF1- 


0EY3- 






CF2- 




I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ WEO SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


SSF 


0000 0001 


00 0001 


XX XXXX 


0 10 


00 0001 


0 0000 10 X X 


XXXX 


0 


0 


110 


1 



Source 01101010 0000 1000 1 100 01 10 0001 1000 



Source 

Intermediate* 
Result 

Destination 



0101 0001 0000 0000 01 1 1 0101 001 1 0000 



1011 1011 0000 1001 0011 1011 0100 1000 



0101 1101 100001001001 1101 10100100 



S «- DB bus 

ALU Shifter *- R + S + Cn 
RF(1) «- ALU shift result 



*After the intermediate operation (ADD), overflow has occurred and OVR status signal is set high. When the 
arithmetic right shift is executed, the sign bit is corrected (see Table 16 for shift definition notes). 
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SRAD 



Arithmetic Right Double Precision Shift 



FUNCTION 

Performs arithmetic right shift on MQ register (LSH) and result of ALU operation (MSH) 
specified in lower nibble of instruction field. 



DESCRIPTION 

The result of the ALU operation specif ied in instruction bits 13-10 is used as the upper 
half of a double precision word, the contents of the MQ register as the lower half. 

The contents of the ALU are shifted one bit to the right. The sign bit of the most 
significant byte is retained unless the sign bit is inverted as a result of overflow. Bit 0 
of the least significant byte in the ALU shifter is passed to bit 7 of the most significant 
byte of the MQ register. Bit 0 of the MQ register's least significant byte is dropped. 

The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the Y MUX. If SSF is low, the ALU result will be passed unshifted to 
the Y MUX. 

*A list of ALU operations that can be used with this instruction is given in Table 15. 
Shift Operations 



ALU Shifter 


MQ Shifter 


Arithmetic Right 


Arithmetic Right 



CM 
CO 
00 
00 
h- 

o 
< 

2 



Available Destination Operands (ALU Shifter) 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


Yes 




Control/Data Signals 




Signal 


User 
Programmable 


Use 


SSF 


Yes 




Passes shifted output if high; passes ALU result 
if low. 


SIOO 


No 




LSB of ALU shifter is passed to MSB of MQ shifter, 


SioT 


No 




and LSB of MQ shifter is dropped. 


SI02 


No 






SI03 


No 






Cn 


No 




Affects arithmetic operation specified in bits I3-I0 of 
instruction field. 
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Arithmetic Right Double Precision Shift 



SRAD 



Status Signals* 



ZERO = 1 if result = 0 

N = 1 if MSB of result = 1 

= 0 if MSB of result = 0 

OVR = 0 

C = 1 if carry-out condition 




C/> 

z 

> 

o 

H 
00 
00 
GO 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 

EXAMPLE (assumes a 32-bit configuration) 

Perform the computation A = (A + B)/2, where A and B are two's complement numbers. 
Let A be a double precision number residing in register 1 (MSH) and MQ (LSH). Let 
B be a single precision number which is input through the DB bus. 



Instr 
Code 
I7-I0 


Oprd 
Addr 
A5-A0 


Oprd 
Addr 
B5-B0 


Oprd Sel 
EB1- 
EA EBO 


□est 
Addr 
C5-C0 


Destination Selects 
WE3- SELRF1- OEY3- 
SELMQ WEO SELRFO OEA OEB OEYO OES 


Cn 


CF2- 
CFO 


SSF 


0001 0001 


00 0001 


XX XXXX 


0 10 


00 0001 


0 0000 10 X X XXXX 0 


0 


110 


1 



Assume register file 1 holds 4A08C618 (Hex), and DB bus holds 51007530 (Hex), 
and MQ register holds 17299A0F (Hex). 

MSH 



Source 0100 10100000 1000 110001100001 1000 R *- RF(1 



Source 01010001000000000111010100110000 S *- DB bus 



Intermediate* 
Result 



1001 1011 0000 1001 0011 1011 0100 1000 



ALU Shifter ■«*- R + S + Cn 



Destination 



LSH 



Destination 



01001101 100001001001 1101 10100100 



RFM ) «- ALU shift result 



Source 0001 0111 0010 1001 1001 10100000 1111 MQ shifter *~ MQ register 

MQ register MQ shift result 



0000 1011 1001 0100 1100 1101 00000111 



* After the intermediate operation (ADD), overflow has occurred and OVR status signal is set high. When the 
arithmetic right shift is executed, the sign bit is corrected (see Table 16 for shift definition notes). 
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SRC 



Circular Right Single Precision Shift 



FUNCTION 

Performs circular right shift on result of ALU operation specified in lower nibble of 
instruction field. 



DESCRIPTION 

The result of the ALU operation specified in instruction bits 13-10 is shifted one bit 
to the right. Bit 0 of the least significant byte is passed to bit 7 of the most significant 
byte in the same word, which may be 1 , 2, or 4 bytes long depending on the selected 
configuration. 

The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the Y MUX. If SSF is low, the ALU result will be passed unshifted to 
the Y MUX. 

*A list of ALU operations that can be used with this instruction is given in Table 15. 
Shift Operations 



ALU Shifter 


MQ Shifter 


Circular Right 


None 



Available Destination Operands (ALU Shifter) 



CM 
CO 
00 
00 

O 
< 

2 



RF 

(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


Yes 




Control/Data Signals 




Signal 


User 
Programmable 


Use 


SSF 


Yes 




Passes shift result if high; passes ALU result 
if low. 


SIO0 


No 




Rotates LSB to MSB of the same word, which may 


sioT 


No 




be 1, 2, or 4 bytes long depending on configuration 


SI02 


No 






SI03 


No 






Cn 


No 




Affects arithmetic operation specified in bits I3-I0 of 
instruction field. 
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Circular Right Single Precision Shift SRC 



Status Signals t 




C/) 
2 

> 

O 



ZERO =1 if result = 0 

N =1 if MSB of result = 1 
= 0 if MSB of result = 0 
OVR = 1 if signed arithmetic overflow 
C = 1 if carry-out condition 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 

EXAMPLE (assumes a 32-bit configuration) 

Perform a circular right shift of register 6 and store the result in register 1. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 




Destination Selects 












Code 


Addr 


Addr 


EB1- 


Addr 




WE3- SELRF1- 


0EY3- 






CF2- 




I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO SELRFO OEA OEB 


0EY0 


OES 


Cn 


CFO 


SSF 


1000 0110 


00 0110 


XX XXXX 


0 XX 


00 0001 


0 


0000 10 X X 


XXXX 


0 


0 


110 


1 



00 Assume register file 6 holds 3788C618 (Hex). 
00 

CJ I ; : •■ ; ; 

|sj Source 0011 0111 1000 1000 110001100001 1000 



R - RF(6) 



Intermediate 
Result 



0011 0111 1000 1000 110001100001 1000 ALU Shifter *- R + Cn 



Destination 0001 1011 1100010001100011 0000 1100 RF(1) «- ALU shift result 
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SRCD 



Circular Right Double Precision Shift 



FUNCTION 

Performs circular right shift on MQ register (LSH) and result of ALU operation (MSH) 
specified in lower nibble of instruction field. 

DESCRIPTION 

The result of the ALU operation specified in instruction bits 13-10 is used as the upper 
half of a double precision word, the contents of the MQ register as the lower half. 

The contents of the ALU and MQ shifters are rotated one bit to the right. Bit 0 of the 
least significant byte in the ALU shifter is passed to bit 7 of the most significant byte 
of the MQ shifter. Bit 0 of the least significant byte is passed to bit 7 of the most 
significant byte of the ALU shifter. 

The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the Y MXU and MQ register. If SSF is low, the Y MUX and MQ register 
will not be altered. 

*A list of ALU operations that can be used with this instruction is given in Table 1 5. 
Shift Operations 



ALU Shifter 


MQ Shifter 


Circular Right 


Circular Right 



CN 
00 
00 
00 

O 
< 

2 



Available Destination Operands (ALU Shifter) 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


Yes 


Passes shift result if high; passes ALU result and 
retains MQ register if low. 


SIOO 


No 


Rotates LSB of ALU shifter to MSB of MQ shifter, 


SI01 


No 


and LSB of MQ shifter to MSB of ALU shifter 


SI02 


No 




SI03 


No 




Cn 


No 


Affects arithmetic operation specified in bits I3-I0 of 
instruction field. 
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Circular Right Double Precision Shift 



SRCD 



Status Signals* 



ZERO = 1 if result = 0 

N = 1 if MSB of result = 1 

= 0 if MSB of result = 0 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out condition 



CO 

z 

> 

o 

H 
00 
00 
(a) 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 

EXAMPLE (assumes a 32-bit configuration) 

Perform a circular right double precision shift of the data in register 6 (MSH) and MQ 
(LSH), and store the result back in register 6 and the MQ register. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 




WE3- 


Destination Selects 










Code 


• Addr 


Addr 


EB1- 


Addr 




SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-CO 


SELMQ 


wio 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1001 0110 


00 0110 


XX XXXX 


0 XX 


00 01 10 


0 


0000 


10 X X 


XXXX 


0 


0 


110 



Assume register file 6 holds 3788C61 8 (Hex) and MQ register holds 50A99A0F (Hex). 
MSH 



Source 

Intermediate 
Result 

Destination 



0011 0111 0000 1000 1100 0110 0001 1000 



R - RF(6) 



0011 0111 oooo 1000 1100 0110 0001 1000 ALU shifter *- R + Cn 

RF(6) «- ALU shift result 



1001 1011 1000 0100 0110 0011 0000 1100 



LSH 



Source 0101 oooo 1010 1001 1001 1010 oooo 1 1 1 1 MQ shifter MQ register 



Destination 0010 1000 0101 0100 11001101 oooo 0111 MQ register *- MQ shift result 
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SRL 



Logical Right Single Precision Shift 



FUNCTION 

Performs logical right shift on result of ALU operation specified in lower nibble of 
instruction field. 



DESCRIPTION 

The result of the ALU operation specified in instruction bits 13-10 is shifted one bit 
to the right. A zero is placed in the bit 7 of the most significant byte of each word 
unless the SIO input for the word is programmed low; this will force the sign bit to 
one. The LSB is dropped from the word, which may be 1 , 2, or 4 bytes long depending 
on selected configuration. 

The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the Y MUX. If SSF is low, the ALU result will be passed unshifted to 
the Y MUX. 

*A list of ALU operations that can be used with this instruction is given in Table 15. 
Shift Operations 



ALU Shifter 


MQ Shifter 


Logical Right 


None 



CM 
CO 
00 
00 
h- 

o 
< 

z 



Available Destination Operands (ALU Shifter) 



RF 
(C5-CO) 


RF 
(B5-BO) 


Y-Port 


Yes 


No 


Yes 



Control/Data Signals * 



Signal 


User 
Programmable 


Use 


SSF 




Passes shift result if high or floating; passes ALU 
result if low. 


SIO0 


Yes 


Fills a zero in MSB of the word if high or floating; 


SIOT 


Yes 


fills a one in MSB if low. 


ST02 


Yes 




Sl03 


Yes 




Cn 




Inactive 



*Cn is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 
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logical Right Single Precision Shift 



SRL 



EXAMPLE (assumes a 32-bit configuration) 

Perform a logical right single precision shift on data on the DA bus, and store the result 
in register 1 . 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 
















Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


0EY3- 






CF2- 


SI03- 


IESI03- 




I7-IO 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO oIa OEB 


OEYO 


OES 


Cn 


CFO 


sloo 


IESIO0 


SSF 


0010 0110 


XX XXXX 


XX XXXX 


1 XX 


00 0001 


0 


0000 


10 X X 


XXXX 


0 


0 


110 


XXX1 


0000 


1 




Assume DA bus holds 2DA8C615. 
Source 



CO 
2 

> 

O 
H 
00 
00 
CO 



0010 1 101 1010 1000 1 100 01 10 0001 0101 



Intermediate 
Result 

Destination 



0010 1101 1010 1000 1100 0110 0001 0101 



0001 0110 1.101 0100 0110 0011 0000 1010 



R *- DA bus 



ALU Shifter - R + Cn 



RF(1) «- ALU shift result 
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SRLD 



Logical Right Double Precision Shift 



FUNCTION 

Performs logical right shift on MQ register (LSH) and result of ALU operation (MSH) 
specified in lower nibble of instruction field. 



DESCRIPTION 

The result of the ALU operation specified in instruction bits 13-10 is used as the upper 
half of a double precision word, the contents of the MQ register as the lower half. 

The ALU result is shifted one bi t to the right. A zero is placed in the sign bit of the 
most significant byte unless the SIO input for that word is programmed low; this will 
force the sign bit to one. Bit 0 of the least significant byte is passed to bit 7 of the 
most significant byte of the MQ shifter. Bit 0 of the least significant byte of the MQ 
shifter is dropped. 

The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the Y MUX and MQ register. If SSF is low, the ALU result and MQ 
register will not be altered. 

*A list of ALU operations that can be used with this instruction is given in Table 15. 
Shift Operations 



ALU Shifter 


MQ Shifter 


Logical Right 


Logical Right 



CM 
CO 
00 
00 

h- 
o 
< 

z 



Available Destination Operands (ALU Shifter) 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 




Yes 


No 


Yes 




Control/Data Signals 




Signal 


User 
Programmable 


Use 


SSF 


Yes 




Passes shift result if high; passes ALU result and 
retains MQ 


SIO0 


Yes 




Fills a zero in MSB if high or floating; 


SIOT 


Yes 




fills a one MSB if low. 


SI02 


Yes 






SlS3 


Yes 






Cn 


No 




Affects arithmetic operation specified in bits I3-I0 of 
instruction field. 
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Logical Right Double Precision Shift 



SRLD 



Status Signals t 



ZERO = 1 if result = 0 

N = 1 if MSB of result = 1 

= 0 if MSB of result = 0 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out condition 




2 
> 

a 

H 
CO 
00 
GO 
M 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 

EXAMPLE (assumes a 32-bit configuration) 

Perform a logical right double precision shift of the data in register 1 (MSH) and MQ 
(LSH), filling a one into the most significant bit, and store the result back in register 1 
and the MQ register. 



Instr • 


Oprd 


Oprd 


Oprd Sel 


Dest 


Destination Selects 














Code 


Addr 


Addr 


EB1- 


Addr 


WE3- SELRF1- 


0EY3- 






CF2- 


SI03- 


IESI03- 


I7-I0 


A 5- AO 


B5-B0 


EA EB0 


C5-C0 


SELMQ WEO SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


SIOO 


IESIOO 


0011 0110 


XX XXXX 


00 0001 


X 00 


00 0001 


0 0000 10 X X 


XXXX 


0 


0 


no 


1110 


0000 



Assume register file 1 holds 2DA8C61 5 (Hex) and MQ register holds 50A99A0E (Hex). 
MSH 



Source 

Intermediate 
Result 

Destination 



0010 1101 1010 1000 1100 0110 0001 0101 



R *- RF(1) 



0010 1101 1010 1000 110001100001 0101 ALU Shifter *- S + Cn 



1001 0110 1101 0100 0110 0011 0000 1010 



LSH 



Source 



0101 0000 1010 1001 1001 1010 0000 1110 



Destination 



1010 1000 0101 0100 1100 1101 0000 0111 



RF(1) - ALU shift result 

MQ shifter MQ register 
MQ register *- MQ shift result 
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SUBI 



Subtract Immediate 



7 8 



FUNCTION 

Subtracts four-bit immediate data on A3-A0 with carry from S-bus data. 
DESCRIPTION 

Immediate data in the range 0 to 1 5, supplied by the user at A3-A0, is inverted and 
added with carry to S. 



Available R Bus Source Operands (Constant) 









C3-C0 


RF 

(A5-A0) 


A3-A0 
I mimed 


DA-Port 


A3-A0 
Mask 


No 


Yes 


No 


No 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





CO 
00 
00 
h- 
O 
< 

z 



Available Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 

(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


None 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


sioo 


No 


Inactive 


SIOT 


No 


Inactive 


SI02 


No 


Inactive 


SI03 


No 


Inactive 


Cn 


Yes 


Two's complement subtraction if programmed high. 
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an 



Subtract Immediate 



SUBI 



Status Signals 



ZERO = 1 if result = 0 

N = 1 if MSB =1 

OVR = 1 if arithmetic signed overflow 

C = 1 if carry-out 



EXAMPLE (assumes a 32-bit configuration) 

Subtract the value 1 2 from data on the DB bus, and store the result into register file 1 . 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 


Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 


WE3- SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ WEO SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


0111 1000 


00 1100 


XX XXXX 


X 10 


00 0001 


0 0000 10 X X 


XXXX 


0 


1 


110 



c/> 
z 

o 

H 
CO 
00 
Ca> 



Assume bits A3-A0 hold C (Hex) and DB bus holds 24000100 (Hex). 
Source oooo oooo oooo oooo oooo oooo oooo 1 1 oo R A3- AO 



Source 00100100 oooo oooo oooo 0001 oooo oooo 



DB bus 



Destination 0010 010000000000 0000 000011110100 RF(1) R' + S + Cn 
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SUBR 



Subtract R with Carry (R' + S 4- Cn) 



FUNCTION 

Subtracts data on the R bus from S with carry. 
DESCRIPTION 

Data on the R bus is subtracted with carry from data on the S bus. The result appears 
at the ALU and MQ shifters. 

*The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14) of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 1 5. 



Available R Bus Source Operands 



RF 
(A5-AO) 


A3-A0 
Immed 


DA-Port 


C3-C0 

A3-A0 
Mask 






Yes 


No 


Yes 


No 


Available S Bus Source Operands 




RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 


Available Destination Operands 


RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


ALU 
Shifter 


MQ 
Shifter 




Yes 


No 


Yes 


Yes 


Yes 


Control/Data Signals 








Signal 


User 
Programmable 


Use 


SSF 


No 




Affect shift instructions programmed in bits I7-I4 of 


sloo 


No 




instruction field. 




SIOT 


No 










Si02 


No 










SI03 


No 










Cn 


Yes 




Two's complement subtraction if programmed high. 



CN 
00 
00 
00 
H 
O 
< 

z 
c/> 
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Subtract R with Carry (R' + S + Cn) 



SUBR 



Status Signals* 



ZERO = 1 if result = 0 

N = 1 if MSB = 1 

OVR = 1 if signed arithmetic overflow 

C = 1 if carry-out 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated after shift 
operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 




EXAMPLE (assumes a 32-bit configuration) 

Subtract data in register 1 from data on the DB bus, and store the result in the MQ 
' register. 

z 

> 

O 

00 Assume register file 1 holds 150084D0 (Hex) and DB bus holds 4900C350 (Hex). 
00 

00 | — : 1 

jsj Source 0001 0101 oooooooo 10000100 1101 0000 R «- RF(1) 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 




Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 


WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1110 0010 


00 0001 


XX XXXX 


0 10 


XX XXXX 


1 XXXX 


XX X X 


XXXX 


0 


1 


110 



Source 0100 1001 0000 0000 1 100 001 1 0101 0000 



DB bus 



Destination 001 1 0100 0000 0000 001 1 1110 1000 0000 MQ register *- R' + S + Cn 
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SUBS 



Subtract S with Carry (R + S' + Cn) 



n 



FUNCTION 

Subtracts data on the S bus from R with carry. 
DESCRIPTION 

Data on the S bus is subtracted with carry from data on the R bus. The result appears 
at the ALU and MQ shifters. 

*The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14} of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 1 5. 



Available R Bus Source Operands 









C3-CO 




RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 




Yes 


No 


Yes 


No 




Available S Bus Source Operands 




RF 
(B5-B0) 


DB-Port 


MQ 
Register 






Yes 


Yes 


Yes 






Available Destination Operands 




RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


ALU 
Shifter 


MQ 
Shifter 


Yes 


No 


Yes 


Yes 


Yes 



CO 
00 
00 

h- 
o 
< 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Affect shift instructions programmed in bits I7-I4 of 


sioo 


No 


instruction field. 


SI01 


No 




SI02 


No 




SI03 


No 




Cn 


Yes 


Two's complement subtraction if programmed high. 
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Subtract S with Carry (R + S' + Cn) 



SUBS 



Status Signals * 



ZERO = 1 if result = 0 

N = 1 if MSB = 1 

OVR = 1 if signed arithmetic overflow 

C =1 if carry-out 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 



EXAMPLE (assumes a 32-bit configuration) 

Subtract data on the DB bus from data in register 1 , and store the result in the MQ 
register. 

C/> 
2 

> 

O 

5q Assume register file 1 holds 150084D0 (Hex) and DB bus holds 4900C350 (Hex). 
00 

(a) f- — : 1 

Source 0001 01.01 0000 0000 10000100 1101 0000 R *- RF(1) 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3- 






CF2- 


I7-I0 


A5-A0 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1110 0011 


00 0001 


XX XXXX 


0 10 


XX XXXX 


1 


XXXX 


XX X X 


XXXX 


0 


1 


110 



Source 0100 1001 0000 0000 1100 0011 0101 0000 



DB bus 



Destination 1100 1011 1111 1111 11000001 10000000 MQ register *- R + S' + Cn 
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TBO 



Test Bit (Zero) 



3 8 



FUNCTION 

Tests bits in selected bytes of S-bus data for zeros using mask in C3-C0: : A3-AO. 
DESCRIPTION 

The S bus is the source word for this instruction. The source word is passed to the 
ALU, where it is compared to an 8-bit mask, consisting of a concatenation of the C3-C0 
and A3-A0 address ports (C3-C0::A3-A0). The mask is input via the R bus. The test 
will pass if the selected byte has zeros at all bit lo catio ns specified by the ones of 
the mask. Bytes are selected by programming the SIO inputs low. Test results are 
indicated on the ZERO output, which goes to one if the test passes. Register write 
is internally disabled during this instruction. 



Available R Bus Source Operands 









C3-C0 


RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


No 


No 


No 


Yes 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





CM 
CO 
00 
00 
h- 
O 
< 

z 



Control/Data Signals 



Signal 


User 
Programmable 




SSF 


No 


Inactive 


sISo 


Yes 


Byte Select 


SlST 


Yes 


Byte Select 


SI02 


Yes 


Byte Select 


SI03 


Yes 


Byte Select 


Cn 


No 


Inactive 



Use 
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3 8 



Test Bit (Zero) 



TBO 



Status Signals 



ZERO 


= 1 


N 


= 0 


OVR 


= o 


C 





EXAMPLE (assumes a 32-bit configuration) 

Test bits 7, 6 and 5 of bytes 0 and 2 of data in register 3 for zeroes. 




z 

> 
o 

H 
00 
00 
CO 

to 



Instr 


Mask 


Oprd 


Oprd Sei 


Mask 




Destination Selects 














Code 


(LSH) 


Addr 


EB1: 


(MSH) 




WE3- SELRF1- 


OEY3- 






CF2- 


SI03- 


IESI03- 


I7-IO 


A3-A0 


B5-B0 


EA EBO 


C3-C0 


SELMQ 


WEO SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


SIOO 


IESIO0 


0011 1000 


0000 


00 0011 


X 00 


1110 


X 


XXXX XX X X 


XXXX 


0 


X 


110 


1010 


0000 



Assume register file 3 holds 881CD003 (Hex). 



Source 1 1 10 00001 1 10 0000 1 1 10 0000 1 1 10 0000 R Mask (C3-C0::A3-A0) 
Source 



1000 1000 0001 1 100 11 01 0000 0000 001 1 I SN *- RF(3)n T 



= nth byte 



Output Q] ZERO — 1 
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TBI 



Test Bit (One) 



2 8 



FUNCTION 

Tests bits in selected bytes of S-bus data for ones using mask in C3-CO::A3-AO. 
DESCRIPTION 

The S bus is the source word for this instruction. The source word is passed to the 
ALU, where it is compared to an 8-bit mask, consisting of a concatenation of the C3-C0 
and A3-A0 address ports (C3-C0::A3-A0). The mask is input via the R bus. The test 
will pass if the selected byte has ones at all bit locations specified by the ones of the 
mask. Bytes are selected by programming the SIO inputs low. Test results are indicated 
on the ZERO output, which goes to one if the test passes. Register write is internally 
disabled for this instruction. 



Available R Bus Source Operands 









C3-C0 


RF 

(A5-A0) 


A3-A0 
Immed 


DA-Port 


A3-A0 
Mask 


No 


No 


No 


Yes 


Available S Bus Source Operands 


RF 
(B5-B0) 


DB-Port 


MQ 
Register 




Yes 


Yes 


Yes 





CM 
CO 
00 
00 
h- 

o 
< 

z 



Control/Data Signals 



Signal 



User 
Programmable 



Use 



SSF 

SIO0 

SIOT 

SI02 

SI03 

Cn 



No 

Yes 

Yes 

Yes 

Yes 

No 



Inactive 
Byte Select 
Byte Select 
Byte Select 
Byte Select 
Inactive 
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2 8 



Test Bit (One) 



TBI 



Status Signals 



ZERO =1 if result (selected bytes) = Pass 

N = 0 

OVR =0 

C = 0 



EXAMPLE (assumes a 32-bit configuration) 

Test bits 7, 6 and 5 of bytes 1 and 2 of data in register 3 for ones. 



Instr 


Mask 


Oprd 


Oprd Set 


Mask 




Destination Selects 














Code 


<LSH) 


Addr 


EB1- 


(MSH) 




WE3- SELRF1- 


OEY3- 






CF2- 


SI03- 


IESI03- 


I7-I0 


A3-A0 


B5-B0 


EA EBO 


C3-C0 


SELMQ 


WEO SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


SIOO 


IESIO0 


0010 1000 


0000 


00 001 1 


X 00 


1110 


X 


XXXX XX X X 


XXXX 


0 


X 


110 


1001 


0000 



2 

> 

O 
— I 
00 
00 
Ca) 
IS) 



Assume register file 3 holds 881CF003 (Hex). 



Mask 11100000111000001110 000011100000 Rn *- Mask (C3-C0::A3-A0) 

Sn RF(3)n T 



Source 



1000 1000 0001 1100 1101 0000 0000 0011 



T n = nth byte 



Output [T| ZERO 0 
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UDIVI 



Unsigned Divide Iterate 



C 



0 



FUNCTION 

Performs one of N-2 iterations of nonrestoring unsigned division by a test subtraction 
of the N-bit divisor from the 2N-bit dividend. An algorithm using this instruction can 
be found in the "Other Arithmetic Instructions" section. 



DESCRIPTION 

UDIVI performs a test subtraction of the divisor from the dividend to generate a quotient 
bit. The test subtraction may pass or fail and is corrected in the subsequent instruction 
if it fails. Similarly a failed test from the previous instruction is corrected during 
evaluation of the current UDIVI instruction (see the "Other Arithmetic 
lnstructions"section for more details). 

The R bus must be loaded with the divisor, the S bus with the most significant half 

of the result of the previous instruction (UDIVI during iteration or UDIVIS at the £j 

beginning of iteration). The least significant half of the previous result is in the MQ CO 

register. 00 

p- 

UDIVI checks the result of the previous pass/fail test and then evaluates: ^ 



F *- R + S if the test is failed 

F R' + S + Cn if the test is passed 

A double precision left shift is performed; bit 7 of the most significant byte of the 
MQ shifter is transferred to bit 0 of the least significant byte of the ALU shifter. Bit 7 
of the most significant byte of the ALU shifter is lost. The unfixed quotient bit is 
circulated into the least significant bit of the MQ shifter. 



CO 



Available R Bus Source Operands 









C3-C0 


RF 


A3-A0 










DA-Port 




(A5-A0) 


Immed 


A3-A0 








Mask 


Yes 


No 


Yes 


No 



Recommended S Bus Source Operands 



RF 
(B5-B0) 


DB-Port 


MQ 
Register 


Yes 


Yes 


No 
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C 0 



Unsigned Divide iterate 



UDIVI 



Recommended Destination Operands Shift Operations 



RF 
(C5-CO) 


RF 
(B5-BO) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


Left 


Left 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


5100 


No 


Passes internally generated end-fill bit. 


SIOT 


No 




5102 


No 




5T03 


No 




Cn 


Yes 


Should be programmed high. 



c/> 
z 

> 

o 

H 

00 
00 
Ca> 



Status Signals 



ZERO 
N 

OVR 
C 



1 if result = C 

0 

0 

1 if carry-out 
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UDIVIS Unsigned Divide Start 1 B 1 0 



FUNCTION 

Computes the first quotient bit of nonrestoring unsigned division. An 
algorithm using this instruction is given in the "Other Arithmetic Instructions" section. 



DESCRIPTION 

UDIVIS computes the first quotient bit during nonrestoring unsigned division by 
subtracting the divisor from the dividend. The resulting remainder due to subtraction 
may be negative; the subsequent UDIVI instruction may have to restore the remainder 
during the next operation. 

The R bus must be loaded with the divisor and the S bus with the most significant 
half of the remainder. The result on the Y bus should be loaded back into the register 
file for use in the next instruction. The least significant half of the remainder is in the 
MQ register. 

UDIVIS computes: 

F *- R' + S + Cn 

A double precision left shift is performed; bit 7 of the most significant byte of the 
MQ shifter is transferred to bit 0 of the least significant byte of the ALU shifter. Bit 7 
of the most significant byte of the ALU shifter is lost. The unfixed quotient bit is 
circulated into the least significant bit of the MQ shifter. 



Available R Bus Source Operands 









C3-C0 


RF 


A3-A0 










DA-Port 




(A5-A0) 


Immed 


A3-A0 








Mask 


Yes 


No 


Yes 


No 



Recommended S Bus Source Operands 



RF 
(B5-B0) 


DB-Port 


MQ 
Register 


Yes 


Yes 


No 



Recommended Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


Left 


Left 
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B 0 



Unsigned Divide Start 



UDIVIS 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


sioo 


No 


Passes internally generated end-fill bit. 


SIOT 


No 




SI02 


No 




Sl03 


No 




Cn 


Yes 


Should be programmed high. 



Status Signals 



CO 
Z 

> 

o 

H 
00 
00 
CO 



ZERO = 1 if intermediate result 

N = 0 

OVR = 1 if divide overflow 

C = 1 if carry-out 



3-188 



UDIVIT 



Unsigned Divide Terminate 



F 0 



FUNCTION 

Solves the final quotient bit during nonrestoring unsigned division. An algorithm using 
this instruction is given in the /'Other Arithmetic Instructions" section. 



DESCRIPTION 

UDIVIT performs the final subtraction of the divisor from the remainder during 
nonrestoring signed division. UDIVIT is preceded by N-1 iterations of UDIVI, where 
N is the number of bits in the dividend. 

The R bus must be loaded with the divisor, the S bus must be loaded with the most 
significant half of the result of the last UDIVI instruction. The least significant half 
lies in the MQ register. The Y bus result must be loaded back into the register file for 
use in the subsequent DIVRF instruction. 

UDIVIT checks the results of the previous pass/fail test and evaluates: 



Y «- r + s 

Y - R' + S + Cn 



if the test is failed 
if the test is passed 



The contents of the MQ register are shifted one bit to the left; the unfixed quotient 
bit is circulated into the least significant bit. 

Available R Bus Source Operands 









C3-C0 


RF 


A3-A0 










DA-Port 




(A5-A0) 


Immed 


A3-A0 








Mask 


Yes 


No 


Yes 


No 



CM 
CO 
00 
00 
H 
U 
< 

Z 



Recommended S Bus Source Operands 



RF 
(B5-B0) 


DB-Port 


MQ 
Register 


Yes 


Yes 


No 



Recommended Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


None 


Left 
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FlO i Unsigned Divide Terminate UDIVIT 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Inactive 


SIOO 


No 


Passes internally generated end-fill bit. 


SIOT 


No 




SI02 


No 




SI03 


No 




Cn 


Yes 


Should be programmed high. 



Status Signals 

(/) ZERO = 1 if intermediate result = 0 

2 N = 0 

OVR =0 

^ C - 1 if carry-out 

O 1 : 

H ■ 
00 
00 
CJ 
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UMULI 



Unsigned Multiply Iterate 



D 0 



FUNCTION 

Performs one of N unsigned multiplication iterations for computing an N-bit by N-bit 
product. An algorithm for unsigned multiplication using this instruction is given in the 
"Other Arithmetic Instructions" section. 



DESCRIPTION 

UMULI checks to determine whether the multiplicand should be added with the present 
partial product. The instruction evaluates: 

F *- R + S + Gn if the addition is required 

F <*- S if no addition is required 

A double precision right shift is performed. Bit 0 of the least significant byte of the 
ALU shifter is passed to bit 7 of the most significant byte of the MQ shifter; carry-out 
is passed to the most significant bit of the ALU shifter. 

The S bus should be loaded with the contents of an accumulator and the R bus with 
the multiplicand. The Y bus result should be written back to the accumulator after 
each iteration of UMULI. The accumulator should be cleared and the MQ register loaded 
with the multiplier before the first iteration. 

R Bus Source Operands 



CNJ 
CO 
00 
00 
h- 
O 
< 

z 

CO 



RF 
(A5-A0) 


A3-A0 
Immed 


DA-Port 


C3-C0 

A3-A0 
Mask 


Yes 


No 


Yes 


No 



Recommended S Bus Source Operands 



RF 
(B5-B0) 


DB-Port 


MQ 
Register 


Yes 


Yes 


No 



Recommended Destination Operands Shift Operations 



RF 
(C5-C0) 


RF 
(B5-B0) 


Y-Port 


Yes 


No 


Yes 



ALU 


MQ 


Right 


Right 
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D 0 



Unsigned Multiply Iterate 



UMULI 



Control/Data Signals 



Signal 


User 
Programmable 


Use 


SSF 


No 


Holds LSB of MQ. 


sioo 


No 


Passes internal input (shifted bit). 


SI01 


No 




SI02 


No 




SI03 


No 




Cn 


Yes 


Should be programmed high. 


Status Signals t 


ZERO = 


1 if result = 0 




N = 


1 if MSB = 1 




OVR = 


0 




C = 


1 if carry-out 





CO 

z 

> 

O 
H 
00 
00 
CO 

ro 



t\/alid only on final execution of multiply iteration 
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XOR Exclusive OR (UXORS) 



FUNCTION 

Evaluates the logical expression R XOR S. 
DESCRIPTION 

Data on the R bus is exclusive ORed with data on the S bus. The result appears at 
the ALU and MQ shifters. 

*The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14) of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 15. 



Available R Bus Source Operands 









C3-C0 




RF 
(A5-A0) 


A3-A0 
Immed 


DA- Port 


A3-A0 
Mask 




Yes 


No 


Yes 


No 




Available S Bus Source Operands 




RF 
(B5-B0) 


DB-Port 


MQ 
Register 






Yes 


Yes 


Yes 






Available Destination Operands 




RF 
(C5-CO) 


RF 
(B5-B0) 


Y-Port 


ALU 
Shifter 


MQ 
Shifter 


Yes 


No 


Yes 


Yes 


Yes 



CM 
CO 
00 
00 

h- 
o 
< 

l> 

C/) 



Control/Data Signals 



Signal 



User 
Programmable 



Use 



SSF 

SIOO 

SloT 

ST02 

ST03 

Cn 



No 
No 
No 
No 
No 
No 



Affect shift instructions programmed in bits I7-I4 of 
instruction field. 



Inactive 
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Exclusive OR (R XOR S) 



XOR 



Status Signals T 



ZERO 
N 

OVR 
C 



1 if result 
1 if MSB = 
0 
0 



*C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 

EXAMPLE (assumes a 32-bit configuration) 

Exclusive OR the contents of register 3 and register 5, and store the result in register 5. 



Instr 


Oprd 


Oprd 


Oprd Sel 


Dest 






Destination Selects 










Code 


Addr 


Addr 


EB1- 


Addr 




WE3- 


SELRF1- 


OEY3 






CF2- 


I7-IO 


A 5- AO 


B5-B0 


EA EBO 


C5-C0 


SELMQ 


WEO 


SELRFO OEA OEB 


OEYO 


OES 


Cn 


CFO 


1111 1001 


00 001 1 


00 0101 


0 00 


00 0101 


0 


0000 


10 X X 


XXXX 


0 


X 


110 



2 
> 

O Assume register file 3 holds 33F6D840 (Hex) and register file 5 holds 90F6D842 (Hex).. 

Source 



00 
00 
CO 



0011 0011 1111 0110 1101 1000 0100 0000 



Source 



1001 0000 1111 0110 1101 1000 0100 0010 



R - RF(3) 
S «- RF(5) 



Destination 1010 001 1 oooo oooo oooo oooo oooo 0010 RF(5) R XOR S 
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1 



SN74ACT8818 16-Bit Microsequencer 



SN74ACT8832 32-Bit Registered ALU 



SN74ACT8836 32- x 32-Bit Parallel Multiplier 




SN74ACT8837 64-Bit Floating Point Processor 



SN74ACT8841 Digital Crossbar Switch 



SN74ACT8847 64-Bit Floating Point/Integer Processor 




'!'" ., ,v", 

Support 



Mechanical Data 



4-1 



4 



C/> 

z 

> 

O 
H 
00 
00 

0s) 
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SN74ACT8836 32 Bit by 32-Bit 
Multiplier/Accumulator 



The SN74ACT8836 is a 32-bit integer multiplier/accumulator (MAC) that accepts 
two 32-bit inputs and computes a 64-bit product. An on-board adder is provided 
to add or subtract the product or the complement of the product from the 
accumulator. 

To speed-up calculations, many modern systems off-load frequently-performed 
multiply/accumulate operations to a dedicated single-cycle MAC. In such an 
arrangement, the 'ACT8836 MAC can accelerate 32-bit microprocessors, 



building block processors, or custom CPUs. The 'ACT8836 is well-suited for 
digital signal processing applications, including fast fourier transforms, digital 
filtering, power series expansion, and correlation. 




CD 
O 
00 
00 
H 
O 
< 

z 
in 
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SN74ACT8836 

32 BIT BY 32 BIT MULTIPLIER/ACCUMULATOR 



D3046, JANUARY 1988 



• Multiplier, Multiplicand, and Product Can be 
Complemented 

• Accumulator Bypass Option 

• TTL I/O Voltage Compatibility 

• Three Independent 32-Bit Buses for 
Multiplicand, Multiplier, and Product 

• Parity Generation/Checking 

• Master/Slave Fault Detection 

• Single 5- V Power Supply 

• Integer or Fractional Rounding 



Performs Full 32-Bit by 32-Bit 
Multiply/Accumulate in Flow-Through Mode 
in 60 ns (Max) 

• Can be Pipelined for 36 ns (Max) Operation 

Performs 64-Bit by 64-Bit Multiplication in 
Five Cycles 

• Supports Division Using Newton-Raphson 
Approximation 

• Signed, Unsigned, or Mixed-Mode Multiply 
Operations 

• EPIC ™ (Enhanced-Performance Implanted 
CMOS) 1-/*m Process 

description 

The 'ACT8836 is a 32-bit by 32-bit parallel multiplier/accumulator suitable for low-power, high-speed 
operations in applications such as digital signal processing, array processing, and numeric data processing. 
High speed is achieved through the use of a Booth and Wallace Tree architecture. 

Data is input to the chip through two registered 32-bit DA and DB input ports and output through a registered 
32-bit Y output port. These registers have independent clock enable signals and can be made transparent 
for flowthrough operations. 

The device can perform two's complement, unsigned, and mixed-data arithmetic. It can also operate as 
a 64-bit by 64-bit multiplier. Five clock cycles are required to perform a 64 bit by 64-bit multiplication 
and multiplex the 128-bit result. Division is supported using Newton-Raphson approximation. 

A multiply/accumulate mode is provided to add or subtract the accumulator from the product or the 
complement of the product. The accumulator is 67 bits wide to accommodate possible overflow. A warning 
flag (ETPERR) indicates whether overflow has occurred. 

A rounding feature in the 'ACT8836 allows the result to be truncated or rounded to the nearest 32-bits. 
To ensure data integrity, byte parity checking is provided at the input ports, and a parity generator and 
master/slave error detection comparator are provided at the output port. 

The SN74ACT8836 is characterized for operation from 0°C to 70 0 C. 
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EPIC is a trademark of Texas Instruments Incorporated 
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ADVANCE INFORMATION documents contain 
information on new product in the sampling or 
preproduction phase of development. Characteristic 
data and other specifications are subject to change 
without notice. 
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SN74ACT8836 

32 BIT BY 32 BIT MULTIPLIER/ACCUMULATOR 



logic symbol 



CLK ■ 

CKEA • 
CKEB - 
CKEI • 
CKEY - 

DASGN - 
DBSGN • 
COMPL ■ 
RNDO ■ 
RND1 • 
ACCO - 
ACC1 - 

SFTO ■ 
SFT1 - 

FTO - 
FT1 - 

SELY • 
SELD 



(H15) 



SELREG 
WEMS 
WlLS 



DA31 " 
DBO ■ 



DB31 



(H2) 



(G14) 



(C12) 



(F15) 



(G13) 



(H12) 



(G12) 



(E15) 



(C14) 



(D13) 



(F1) 



(G4) 



(H13) 



(G15) 



(B3) 



(GD 



JB22L 
„ (C3) 



(E7) 



(DP 



(G3) 



32 x 32 MULTIPLIER/ 
ACCUMULATOR 
* 

74ACT8S36 



DA REG 
DB REG 
I REG 
Y REG 



CLK 
EN 



PAR 
STAT 



DA PORT 
DB PORT 
Y PORT 
MASTER/SLAVE 
EQUAL CHK 



Y PORT 
PARITY 

I/O 



INSTR 
INPUTS 



0 I SHIFTER 

1 I CONTROL 



0 I FEEDTHROUGH 

1 I CONTROL 



PARITY 
INPUTS 



DA 
PORT 



DB 
PORT 



YMUX 
DMUX 
RMUX 
SMUX 



EXTENDED 
PRECISION 



INPUT 
SELECT 



RA or RB 
MS 32-BITS 
LS 32-BITS 



WRITE 
ENABLE 



3 
0 
2 

Y OUT/EN 



TEST I 0 
PINS I 1 



(08) 


(C15) 


(E14) 


(D15) 






(bi4) 


4_ 




(A15) 


«- 

«- 




(E14) 






(B15) 


4 










(M8) 


+ 




(09) 


4 




(P9> 


4- 




(010) 


«- 




(Q5) 






(M7) 


4 




(Q6) 


«- 




(07) 


4- 










(B13) 


«- 




(B12) 


4- 




(B14) 


4 










(C13) 






(D3) 


4- 

4- 




(D2) 



PYO 
PY1 
PY2 
PY3 

PAO 
PA1 
PA2 
PA3 
PBO 
PB1 
PB2 
PB3 

YETPO 
YETP1 
YETP2 



TPO 
TP1 



c 



4 ► 



4 ► 



• Y31 
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SN74ACT8836 

32 BIT BY 32-BIT MULTIPLIER/ACCUMULATOR 



functional block diagram (positive logic) 



PA3-PA0 
PB3-PB0 >- 



PERRA -+ 
PERRB -4 



DA31 DAO-* 



4- SGNEXT 
4- SELD 



4 — «- SFT1-SFT0 



SELREG 



32 

/ 4 DB31 DBO 




OEY 



ETPERR YETP2 YETPO Y31-YO 



MSERR PY3-PY0 
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SN74ACT8836 

32 BIT BY 32 BIT MULTIPLIER/ACCUMULATOR 



GB PIN-GRID-ARRAY PACKAGE 
(TOP VIEW) 



12 3 4 5 6 7 8 9 10 11 12 13 14 15 



A 






B 


•'® . .... 




C 






D 


• • • 


• • • 


• • • 


E 


• • • 




• • • 


F 


• • • 




• # • 


G 


• • • • 




• • • • 


H 


• • • • 




• • • • 


J 


• • • • 




• • • • 


K 


• • • 






L 


• • • 






M 


• • • 


• • • 




N 






P 






R 







GB PACKAGE PIN ASSIGNMENTS 





PIN 




PIN 




PIN 




PIN 




PIN 




PIN 


NO. 


NAME 


NO. 


NAME 


NO. 


NAME 


NO. 


NAME 


NO. 


NAME 


NO. 


NAME 


A1 


Y8 


B12 


YETP1 


014 


PYO 


H12 


COMPL 


M3 


DB18 


P5 


DB25 


A2 


Y10 


B13 


YETPO 


D15 


ETPERR 


H13 


FTO 


M7 


PB1 


P6 


DB29 


A3 


Y11 


B14 


YETP2 


E1 


SELREG 


H14 


EA 


M8 


PAO 


P7 


DB31 


A4 


Y13 


B15 


PY3 


E2 


Y3 


H15 


CKEA 


M10 


DA6 


P8 


PERRA 


A5 


Y14 


CI 


YO 


E3 


GND 


J1 


DB2 


M13 


DA16 


P9 


PA2 


A6 


Y16 


C2 


Y4 


E13 


GND 


J2 


DB3 


M14 


DA17 


P10 


DA2 


A7 


Y18 


C3 


EB 


E14 


PY2 


J3 


DB5 


M15 


DA25 


P1 1 


DA8 


A8 


Y19 


C4 


Y5 


El 5 


RND1 


J4 


DB7 


N1 


DB10 


P1 2 


DA12 


A9 


Y21 


C5 


vcc 


F1 


SFTO 


J12 


DA26 


N2 


DB19 


P1 3 


DA14 


A10 


Y23 


C6 


GND 


F2 


Y1 


J13 


DA24 


N3 


DB20 


P1 4 


DA1 1 


A1 1 


Y25 


C7 


Y15 


F3 


GND 


J14 


DA30 


N4 


DB21 


P1 5 


DA21 


A12 


Y27 


C8 


GND* 


F13 


GND 


J15 


DA31 


N5 


DB23 


R 1 


DB14 


A13 


Y28 


C9 


Y22 


F14 


MSERR 


K1 


DB4 


N6 


DB27 


R2 


DB26 


A14 


Y30 


C10 


GND 


F15 


DASGN 


K2 


DB9 


N7 


vcc 


R3 


DB28 


A15 


PY1 


C11 


vcc 


G1 


SELD 


K3 


DB11 


N8 


GND 


R4 


DB30 


B1 


Y2 


C12 


CKEY 


G2 


SGNEXT 


K13 


DA22 


N9 


DAO 


R5 


PBO 


B2 


Y6 


C13 


OEY 


G3 


WELS 


K14 


DA28 


N10 


DA4 


R6 


PB2 


B3 


SELY 


C14 


ACCO 


G4 


SFT1 


K15 


DA29 


N1 1 


DA10 


R7 


PB3 


B4 


Y7 


C15 


PERRY 


G12 


RNDO 


L1 


DB6 


N12 


DA13 


R8 


PERRB 


B5 


Y9 


D1 


WEMS 


G13 


DBSGN 


L2 


DB15 


N13 


DA15 


R9 


PA1 


B6 


Y12 


D2 


TP1 


G14 


CKEI 


L3 


DB13 


N14 


DA19 


RIO 


PA3 


B7 


Y17 


D3 


TPO 


G15 


FT1 


L13 


DA 18 


N15 


DA23 


R1 1 


DA1 


B8 


Y20 


D7 


GND 


H1 


CLK 


L14 


DA20 


P1 


DB12 


R1 2 


DA3 


B9 


Y26 


D8 


vcc 


H2 


CKEB 


L15 


DA27 


P2 


DB16 


R1 3 


DAB 


B10 


Y29 


D9 


Y24 


H3 


DBO 


M1 


DB8 


P3 


DB24 


R1 4 


DA7 


B1 1 


Y31 


D13 


ACC1 


H4 


DB1 


M2 


DB17 


P4 


DB22 


R1 5 


DA9 
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SN74ACT8836 

32-BIT BY 32 BIT MULTIPLIER/ACCUMULATOR 



PIN 

NAME 


NO. 


I/O 


nPC^DiDTirkivi 
Ucov»nll* 1 Ivlnl 


ACCO 
ACC1 


C14 
D13 




Accumul3t6 mode opcode (see Table 2) 


CLK 


H1 


I 


System clock 


CKEA 


H15 




Clock enable for A register, active low 


CKEB 


H2 




Clock enable for B register, active low 


CKEI 


G 1 4 




Clock enable for 1 register, active low 


CKEY 


C12 


' , 


Clock enable for Y register, active low 


COMPL 


H12 


I 


Product complement control; high complements multiplier result, low passes multiplier unaltered 




to accumulator. 


DAO 


N9 






DA1 


R1 1 






DA2 


P10 






DA3 


R12 






DA4 


N10 






DA 5 


R1 3 






DA6 


M10 






DA7 


R14 






DA8 


P1 1 






DA9 


R1 5 






DA10 


N1 1 






DA1 1 


P14 






DA12 


P12 






DA13 


N12 






DA 14 


P13 






DAI 5 


N 1 3 


I 


DA port input data bits 0 through 31 


DA 16 


M13 






DAI 7 


M14 






DA 18 


L13 






DA 19 


N14 






DA20 


L14 






DA21 


PI 5 






DA22 


K13 






DA23 


N1 5 






DA24 


J13 






DA25 


M1 5 






DA26 


J12 






DA27 


L15 






DA28 


K14 






DA29 


K15 






DA30 


J14 






DA31 


J15 






DASGN 


F15 


I 


Sign magnitude control; high identifies DA input data as two's complement, low identifies DA input 




data as unsigned 
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SN74ACT8836 

32 BIT BY 32-BIT MULTIPLIERjACCUMULATOR 



PIN 

NAME 


NO. 


I/O 


DESCRIPTION 


DBO 


H3 






DB1 


H4 






DB2 


J1 






DB3 


J2 






DB4 


K1 






DB5 


J3 






DB6 


L1 






DB7 


J4 






DB8 


M1 






DB9 


K2 






DB10 


N1 






DB1 1 


K3 






DB12 


P1 






DB13 


L3 






DB14 


R1 






DB15 
DB16 


L2 
P2 


I 


DB port input data bits 0 through 31 


DB17 


M2 






DB18 


M3 






DB19 


N2 






DB20 


N3 






DB21 


N4 






DB22 


P4 






DB23 


N5 






DB24 


P3 






DB25 


P5 






DB26 


R2 






DB27 


N6 






DB28 


R3 






DB29 


P6 






DB30 


R4 






DB31 


P7 






DBSGN 


G13 


I 


Sign magnitude control; high identifies DB input data as two's complement, low identifies DB input 
data as unsigned. 


EA 


H14 


I 


Core multiplier operand select. A high on this signal selects DA register for input on the R bus; a 




low selects the swap MUX. 


EB 


C3 


I 


Core multiplier operand select. A high on this signal selects DB register for input on the S bus; a 




low selects the swap MUX. 


ETPERR 


D15 


0 


Equality check result. A low on this signal indicates that bits 67 through 64 of the core multiplier 




results are equal to bit 63. 


FTO 
FT1 


H13 
G15 


I 


Feedthrough control signals for A, B, I, Pipeline and Y registers (see Table 4). 
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SN74ACT8836 

32 BIT BY 32 BIT MULTIPLIER/ACCUMULATOR 



PIN 

NAME NO. 


I/O 


DESCRIPTION 


GND C6 
GND C8 
GND C10 
GND D7 
GND E3 
GND E13 
GND F3 
GND F13 
GND N8 




Ground pins. All ground pins should be used and connected. 


MSERR F14 


0 


Master/slave error flag. This signal goes high when the contents of the Y output multiplexer and 
the value at the external port are not equal. 


OEY C13 


I 


Y, YETP2-YETP0, and PY3-PY0 output enable, active low. 


PAO M8 
PA1 R9 
PA 2 P9 
PA3 R10 


I 


Parity input data bus for DA input data 


PBO R5 
PB1 M7 

ran o D£> 

rtiZ no 
PB3 R7 


I 


Parity input data bus for DB input data 


rYU U14 
PY 1 A1 5 
PY2 E14 
PY3 B15 


I/O 




Y output parity data bus. Outputs data from parity generator (OEY = L) or inputs external parity 
data (OEY = H). 


PERRA P8 


0 


DA port parity status pin. Goes high if even-parity test on any byte fails. 


PERRB R8 


0 


DB port parity status pin. Goes high if even-parity test on any byte fails. 


PERRY C15 


0 


Y port parity status pin. Goes high if even-parity test on any byte fails. 


RNDO G12 


I 


Multiplier/accumulator rounding control; high rounds integer result; low leaves result unaltered. 


RND1 El 5 


I 


Multiplier/accumulator rounding control; high rounds fractional result; low leaves result unaltered. 


SELD G1 


I 


D multiplexer select. High selects DA and DB ports; low selects multiplier core output. 


SELREG E1 


I 


Write enable for temporary register and accumulator. High enables the temporary register; low enables 
the accumulator. 


SELY B3 


I 


Y multiplexer select. High selects most significant 32 bits of Y register output; low selects least 
significant 32 bits. 


SGNEXT G2 


I 


Sign extend control for multiplexer. A low fills shift matrix bits 66-64 with zeros; a high fills DA31 
in bits 66-64. 


SFTO F1 
SFT1 G4 


I 


Shift multiplexer control (see Table 4). 


TPO D3 
TP1 D2 


I 


Test pins (see Table 5) 


V CC C5 
V CC C11 
V C C D8 
V CC N7 




Supply voltage (5 V) 


WEMS D1 


I 


Write enable for most significant 32 bits of temporary register and accumulator active low. 


WELS G3 


1 


Write enable for least significant 32 bits of temporary register and accumulator active low. 
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SN74ACT8836 

32 BIT BY 32 BIT MULTIPLIER/ACCUMULATOR 



NAME 


NO. 


YO 


C1 


Y1 


F2 


Y2 


B1 


Y3 


E2 


Y4 


C2 


Y5 


C4 


Y6 


B2 


Y7 


B4 


Y8 


A1 


Y9 


B5 


Y10 


A2 


Y11 


A3 


Y12 


B6 


Y13 


A4 


Y14 


A5 


Y15 


C7 


Y16 


A6 


Y17 


B7 


Y18 


A7 


Y19 


A8 


Y20 


B8 


Y21 


A9 


Y22 


C9 


Y23 


A10 


Y24 


D9 


Y25 


A11 


Y26 


B9 


Y27 


A12 


Y28 


A13 


Y29 


B10 


Y30 


A14 


Y31 


B11 


YETPO 


B13 


YETP1 


B12 


YETP2 


B14 



I/O 



DESCRIPTION 



Y port data bus. Outputs data from Y register (OEY = L); inputs data to master/slave comparator 
(OEY = H). 



Data bus for extended precision product. Outputs three most significant bits of the 67-bit multiplier 
core result; inputs external data to master/slave comparator. 



TABLE 1. INSTRUCTION INPUTS 



Signal 


High 


Low 


DASGN 


Identifies DA Input data as two's complement 


Identifies DA input data as unsigned 


DBSGN 


Identifies DB input data as two's complement 


Identifies DB input data as unsigned 


RNDO 


Rounds integer result 


Leaves integer result unaltered 


RND1 


Rounds fractional result 


Leaves fractional result unaltered 


COMPL 


Complements the product from the multiplier 
before passing it to the accumulator 


Passes the product from the multiplier to the 
accumulator unaltered 


ACCO 
ACC1 


See Table 2 


See Table 2 
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SN74ACT8836 

32-BIT BY 32-BIT MULTIPLIER/ACCUMULATOR 



TABLE 2. MULTIPLIER/ ADDER CONTROL INPUTS 



ACC1 


ACCO 


EA 


EB 


Operation 


0 


0 


X 


x 


± (R x S) + 0 


0 


1 


X 


X 


±(R x S) + ACC 




0 


X 


X 


±(R x S) - ACC 




1 


0 


0 


±1x1+0 




1 


0 


1 


±1 x DB + 0 




1 


1 


0 


±DA x 1 +0 




1 


1 


1 


±DA x DB + 0 



ACC is the data stored in the accumulator 



TABLE 3. SHIFTER CONTROL INPUTS 



SFT1 


SFTO 


Shifter Operation 


L 


L 


Pass data without shift 


L 


H 


Shift one bit left; fill with zero 


H 


L 


Swap upper and lower halves of temporary register 


H 


H 


Shift 32 bits right; fill with sign bit 



TABLE 4. FLOWTHROUGH CONTROL INPUTS 



Control Inputs 
FT 1 FTO 


Registers Bypassed 


Pipeline Y I A B 


L L 
L H 
H L 
H H 


Yes Yes Yes Yes Yes 
Yes No No No No 
Yes Yes No No No 
No No No No No 



TABLE 5. TEST PIN CONTROL INPUTS 



TP1 


TPO 


Operation 


L 


L 


All outputs and l/Os forced low 


L 


H 


All outputs and l/Os forced high 


H 


L 


All outputs placed in a high impedance state 


H 


H 


Normal operation (default state) 



data flow 

Two 32-bit input data ports, DA and DB, are provided for input of the multiplicand and multiplier to registers 
A and B and the multiplier/adder. Input data can be clocked to the A and B registers before being passed 
to the multiplier/adder if desired. Two multiplexers, R and S, in conjunction with a flowthrough decoder 
select the multiplier operands from DA and DB inputs, A and B registers, or the temporary register. Data 
is supplied to the temporary register from a shifter that operates on external DA/DB data or a previous 
multiplier/adder result. The 67-bit multiplier/adder result can be output through the Y port or passed through 
the shifter to the accumulator. 

External DA and DB data is also available to the accumulator via the shifter. This 64-bit data can be extended 
with zeros or the sign bit. The 64 least significant bits from the shifter may also be latched in the 64-bit 
temporary register and input to the multiplier through the R and S multiplexers. A swap option allows the 
most significant and least significant 32-bit halves of temporary register data to be swapped before being 
made available to the R and S multiplexers. This allows either 32-bit half of the temporary register to be 
used as a multiplier. 
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SN74ACT8836 

32 BIT BY 32-BIT MULTIPLIER1ACCUMULAT0R 



architectual elements 

Included in the functional block diagram of the 'ACT8836 are the following blocks. 

1 . Two 32-bit registered input data ports DA and DB 

2. A parity checker at the DA and DB inputs 

3. An instruction decoder (I register) 

4. A flowthrough decoder that permits selected registers to be bypassed to support up to three 
levels of pipelining 

5. R and S multiplexers to select operands for the multiplier/ adder from DA and DB inputs, registers 
A and B, or temporary register 

6. A D multiplexer that selects the operand for the shifter from the 67-bit sign-extended DA and 
DB inputs or the multiplier/adder output 

7. A shifter block that operates on DA/DB input data or on multiplier/adder outputs for scaling or 
Newton-Raphson division 

8. A Y output multiplexer that selects the most significant half or the least significant half of the 
multiplier/ adder result for output at the registered Y port 

9. An extended precision error check that tests for overflow 

1 0. A master/slave comparator and parity generator/comparator at the Y output port for master/slave 
and parity checking 

1 1 . Registers at the external data and instruction input ports and the shifter and multiplier/adder 
output port to support pipe-lining 

input data parity checker 

An even-parity check is performed on each byte of input data at the DA, DB and Y ports. If the parity 
test fails for any byte, a high appears at the parity error output pin (PERRA for DA data, PERRB for DB 
data, PERRY for Y data). 

A and B registers 

Register A can be loaded with data from the DA bus, which normally holds a 32-bi t multi plica nd. Re gister 
B is loaded from the DB bus which holds a 32-bit multiplier. Separate clock enables, CKEA and CKEB, allow 
the registers to be loaded separately. This is useful when performing double precision multiplication or 
using the temporary register as an input to the multiplier/adder. The registers can be made transparent 
using the FT inputs (see Table 4). 

instruction register 

Instruction inputs to the device are shown in Table 1 . These signals control signed, unsigned, and mixed 
multiplication modes, fractional and integer rounding, accumulator operati ons a nd complementing of 
products. They can be latched into instruction register I when clock enable CKEI is low. 

Sign control inputs DASGN and DBSGN identify DA and DB input data as signed (high) or unsigned (low). 

Rounding inputs RNDO and RND1 control rounding operations in the multiplier/adder. A low on these inputs 
passes the results unaltered. If a high appears on RND1, the result will be rounded by adding a one to 
bit 30. RND1 should be set high if the multiplier/adder result is to be shifted in order to maintain precision 
of the least significant bit following the shift operation. If a high appears on RNDO, the result will be rounded 
by adding a one to bit 31. This code should be used when the adder result will not be shifted. 

A complement control, COMPL, is used to complement the product from the muliplier before passing it 
to the accumulator. The complement will occur if COMPL is high; the product will be passed unaltered 
if COMPL is low. 

ACC1-A.CC0 control the operation of the multiplier/adder. Possible operations are shown in Table 2. 
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32 BIT BY 32 BIT MULTIPLIER/ACCUMULATOR 



PA3-PA0-*- 
PB3-PB0-*- 



PERRA-4— 
PERRB -4- 



DA31-DA0-*- 



CKEA 



CKEI+- 



-fit- 



-'32 



PARITY 
CHECK 
A/B 



\ D MUX f 



A 

REGISTER 




ACCi-ACCO-*/^ 



\ i mux y 



B 

REGISTER 



\ B MUX / 



^ r mux ~y \ s Mux"y ^ 



MULTIPLIER/ADDER STAGE 1 



PIPELINE REGISTER 



MULTIPLIER/ADDER STAGE 2 
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INPUT REGISTERS AND PARITY CHECK 



S, and swap multiplexers 

The R and S multiplexers select the multiplier/adder operands from external data or from the temporary 
register. 

When EA is low, the R multiplexer selects data from the swap multiplexer. When EA is high, the R multiplexer 
selects data from DA or the A register, depending on the state of the flowthrough control inputs (see 
Table 4). When EB is low, the S multiplexer selects data from the swap multiplexer. When EB is high, the S 
multiplexer switches data from DB or the B register, depending on the state of the flowthrough control inputs. 

EA and EB are also used in conjunction with the multiplier/adder control inputs to force a numeric one 
on the R or S inputs (see Table 2). 

The swap multiplexers are controlled by the shifter control inputs. When SFT1 is high and SFTO is low, 
the most significant half of the temporary register is available to the S multiplexer, and the least significant 
half is available to the R multiplexer. When SFT1-SFT0 are set to other values, the most significant half 
of the temporary register is available to the R multiplexer, and the least significant half is available to the 
S multiplexer. 



ultiplier/adder 

The multiplier performs 32-bit multiplication and generates a 67-bit product. The product can be latched 
in the pipeline to increase cycle speed. The product is complemented when COMPL is set high as shown 
in Table 1 . The adder computes the sum or the difference of the accumulator and the product and gives 
a 67-bit sum. Bits 66-64 are used for overflow and sign extension. 
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D multiplexer 

The D multiplexer selects input data for the shifter. Two sources are available to the multiplexer: a 64-bit 
word formed by concatenating DA and DB bus data, and the 67-bit sum from the multiplier/adder. If SELD 
is high, external DA/DB data is selected; if SELD is low, the sum is selected. 

If the 64-bit word is selected for input to the shifter, three bits are added to the word based on the state 
of the sign extend signal (SGNEXT). If SGNEXT is low, bits 66-64 are zero-filled; if SGNEXT is 'high, bits 
66-64 are filled with the value on DA31 . 



temporary register and accumulator (Figure 1) 

Output from the shifter will be stored in the temporary register if SELREG is high and in the accumulator 
register if SELREG is low. The 64-bit temporary register can be used to store temporary data, constants 
and scaled binary fractions. 



Separate clock controls, WELS and WEMS, allow the most significant and least significant halves of the 
shifter output to be loaded separately. The 32 l east sig nificant bits of the se lecte d regist er are loaded when 
WELS is low; the most significant bits when WEMS is low. When WELS and WEMS are both low, the 
entire word from the shifter is loaded into the selected register. 
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FIGURE 1. TEMPORARY REGISTER AND ACCUMULATOR 
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shifter 



The shifter can be used to multiply by two for Newton-Raphson operations or perform a 32-bit shift for 
double precision multiplication. The shifter is controlled by two SFT inputs, as shown in Table 3. 



Y register 

Final or intermediate multiplier/adder results will be clocked into Y register when CKEY is low. 

Results can be passed directly to the Y output multiplexer using flowthrough decoder signals to bypass 
the register (see Table 4). 

Y multiplexer and Y output multiplexer 

The Y multiplexer allows the 64-bit result or the contents of the Y register to be switched to the Y bus, 
depending upon the state of the flowthrough control outputs. The upper 32 bits are selected for output 
when the Y output multiplexer control SELY is high; the lower 32 bits are selected for output when SELY 
is low. Note that the Y output multiplexer can be switched at twice the clock rate so that the 64-bit result 
can be output in one clock cycle, 
flowthrough decoder 

To enable the device to operate in pipelined or flowthrough modes, on-chip registers can be bypassed using 
flowthrough control signals FT1 and FTO. Up to three levels of pipeline can be supported, as shown in 
Table 4. 



FT 1 -FTO » / 2 |FTDECODE| I > 
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FIGURE 2. Y OUTPUT 



(0 
CO 
00 
CO 
H 
O 
< 

r> 
2 



z 
o 

< 
s 

cc 
o 

u. 



UJ 

o 
z 
< 
> 

< 



Texas ^ 
Instruments 

POST OFFICE 80X 655012 • DALLAS. TEXAS 75265 



4-17 



SN74ACT8836 

32 BIT BY 32 BIT MULTIPLIER/ACCUMULATOR 




4-SELY 
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FIGURE 3. OUTPUT ERROR CONTROL 
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Three extended product outputs, YETP2-YETP0, are provided to recover three bits of precision during 
overflow. An extended precision check error signal (ETPERR) goes high whenever overflow occurs. If sign 
controls DASGN and DBSGN are both low, indicating an unsigned operation, the extended precision bits 
66-64 are compared for equality. Under all other sign control conditions, bits 66-63 are compared for 
equality. 

master slave comparator 

A master/slave comparator is pro vided to compare data bytes from the Y output multiplexer with data 
bytes on the external Y port when OEY is high. A comparison of the three extended precision bits of the 
multiplier/adder result or Y register output with external data in the YETP1-YETP0 port is performed 
simultaneously. If the data is not equal, a high signal is generated on the master slave error output pin 
(MSERR). A similar comparison is performed for parity using the PY3-PY0 inputs. This feature is useful 
in fault-tolerant design where several devices vote to ensure hardware integrity. 

test pins 

Two pins, TP1-TP0, support system testing. These may be used, for example, to place all outputs in a 
high-impedance state, isolating the chip from the rest of the system (see Table 5). 

data formats 

The 'ACT8836 performs single-precision and double-precision multiplication in two's complement, unsigned 
magnitude, and mixed formats for both integer and fractional numbers. 

Input formats for the multiplicand (R) and multiplier (S) are given below, followed by output formats for 
the fully extended product. The fully extended product (PRDT) is 67 bits wide. It includes the extended 
product (XTP) bits YETP1-YETP0, the most significant product (MSP) bits Y63-Y32, and the least significant 
product (LSP) bits Y31-Y0. 
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This can be represented in notational form as follows: 
PRDT = XTP : : MSP : : LSP 

or 

PRDT = YETP2 - YETPO : : Y63 - YO 

Table 6 shows the output formats generated by two's complement, unsigned and mixed-mode 
multiplications. 



TABLE 6. GENERATED OUTPUT FORMATS 





Two's Complement 


Unsigned Magnitude 


Two's Complement 


Two's Complement 


Two's Complement 


Unsigned Magnitude 


Two's Complement 


Unsigned Magnitude 



examples 

Representative examples of single-precision multiplication, double-precision multiplication, and division using 
Newton-Raphson binary division algorithm are given below. 

single-precision multiplication 

Microcode for the multiplication of two signed numbers is shown in Figure 1 . In this example, the result 
is rounded and the 32 most significant bits are output on the Y bus. A second instruction (SELY = 0) 
would be required to output the least significant half if rounding were not used. 

Unsigned and mixed mode single-precision multiplication are executed using the same code. (The sign 
controls must be modified accordingly.) Following are the input and output formats for signed, unsigned, 
and mixed mode operations. 



Input Operand A 



Two's Complement Integer Inputs 



Input Operand B 



30 29 



_231 2 30 2 29 2 2 

Sign) 



2 i 20 



-2 31 2 30 2 29 
(Sign) 



22 2 1 2° 



CD 
CO 
00 
00 

h- 
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C/) 



Input Operand A 



Unsigned Integer Inputs 



Input Operand B 



31 30 29 



31 30 29 



231 2 30 2 29 2 2 2 1 2° 



2 31 2 3 ° 2 29 



22 2 1 2° 



Z 

o 
< 



Two's Complement Fractional Inputs 
Input Operand A Input Operand B 



31 30 29 



-2° 2 _1 2 -2 . 2~ 29 2 -30 2~ 31 

iign) 



31 30 29 



-2° 2~ 1 2~ 2 
(Sign) 
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Unsigned Fractional Inputs 

Input Operand A Input Operand B 

31 30 29 2 1 0 | | 31 30 29 . . 2 1 0 ~ 

2-1 2"2 2- 3 2" 30 2" 31 2 32 2~ 1 2 2 2 3 2~ 30 2~ 31 2- 32 



Two's Complement Integer Outputs 



Extended 
Product 
(YETP2-YETP0) 



66 



65 



64 



_266 2 65 2 64 



Most Significant Product 
(Y63-Y32) 



Least Significant Product 
(Y31-Y0) 



263 2 62 2 61 2 34 2 33 2 32 



] [IT 



30 29 2 



231 2 30 2 29 2 2 



2 0 



(Sign) 



Unsigned Integer Outputs 
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Extended 
Product 
(YETP2-YETP0) 

66 65 64 | 
266 2 65 2 64 



Extended 
Product 
(YETP2-YETP0) 

66 65 64 

-2 4 2 3 2 2 



(Sign) 



Extended 
Product 
(YETP2-YETP0) 

66 65 64 
22 2 1 2° 



Most Significant Product 
(Y63-Y32) 



Least Significant Product 
(Y31-Y0) 



63 62 61 



30 31 32 



31 30 29 



1 



263 262 2 61 2 34 2 33 2 32 2 31 2 30 2 29 2 2 2 1 2° 



Two's Complement Fractional Outputs 



Most Significant Product 
(Y63-Y32) 



Least Significant Product 
(Y31-Y0) 



63 62 61 



30 31 32 



2 1 20 2- 



2-28 2~ 2 9 2- 30 2 31 2 32 2~ 33 



Unsigned Fractional Outputs 



2 -60 2~ 61 2~ 62 



Most Significant Product 
(Y63 Y32) 



Least Significant Product 
(Y31-Y0) 



63 62 61 



30 31 32 



1 [ 



31 30 29 



2 2 2- 3 



2-30 2 31 2~ 32 2- 33 2" 34 2~ 35 



2-62 2 -6 3 2" 64 
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double-precision multiplication 

To simplify discussion of double-precision multiplication, the following example implements an algorithm 
using one 'ACT8836 device. It should be noted that even higher speeds can be achieved through the use 
of two 'ACT8836s to implement a parallel multiplier. 

The example is based on the following algorithm where A and B are 64-bit signed numbers. 
Let 

A m = a s ,a62, a61 332 

and 

Aj = 331, 330, 329 30 (ao = LSB) 

Therefore: 

A = (A m x 232) + A| 
Likewise: 

B = (B m x 232) + B| 
Thus: 

A x B = [(A m x 232) + A|j x [(B m x 232) + B |] 

= (A m x B m ) 264 + (A m x B| + A|] x B m )232 + A| x B| 

Therefore, four products snd three summstions with rank adjustments are required. 

Bssic implementstion of this slgorithm uses 3 single 'ACT8836. The result is a two's complement 1 28-bit 
product. Microcode signsls to implement the slgorithm sre shown in Figure 4. 

The first instruction cycle computes the first product, A| x B|. The lesst significsnt hslf of the result is 
output through the Y port for storsge in an externsl RAM or some other 32-bit register; this will be the 
lesst significsnt 32-bit portion of the final result. 

The instruction also uses the shifter to shift the A| x B| product 32 bits to the right in order to adjust 
for ranking in the next multiplication-sddition sequence. The lesst significsnt half of the shift result is stored 
in the lower 32-bit portion of the sccumulstor; the upper 32 bits contsin the zero snd fill. 

The second instruction produces the second product, A| x B m , sdds it to the contents of the sccumulstor, 
snd stores the result in the sccumulstor for use in the third instruction. 

Instruction 3 computes A m x B|, sdds the result to the sccumulstor, snd outputs the lesst significsnt 
32 bits of the addition for use ss bits 63-32 of the final product. 

This instruction slso shifts the result 32 bits to the right to provide the necesssry rank sdjustment snd 
stores the shift result (the most significsnt half of the sddition result) in the lower 32 bits of the sccumulstor. 
Bits ACC63-ACC32 are filled with zeros; the sign is extended into the three upper bits (ACC66-ACC64). 

Instruction 4 computes the fourth product (Am x Bm), sdds it to the sccumulstor, snd outputs the lesst 
significsnt half at the Y port for use as bits 95-64 of the final product. 

This example assumes that the chip is operating in feed-through mode. A fifth instruction is therefore required 
to perform the fourth iteration sgsin so thst bits 127-96 of the final product can be output. 
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Example 1 . Single Precision Multiply, 32-Bit Result 



Operand 
Select 
R bus 
S bus 
EA EB 


Instruction inputs 


D-MUX 
Select 

SELD 


Sign 
Extend 
SGIMEXT 


Shift-MUX 
Control 
SFT1 SFTO 


Register 
Load 
Select 

SELREG 


Register 
Write 
Enable 

WEH WET 


Feed- 
through 
Control 
FT1 FTO 


Clock Enables 

I A B Y 
CKEl CKEA CKEB CKEY 


Y-MUX 
Select 
SELY 


Y/PY 
Output 
Enable 

OEY 


Sign 
DASGN DBSGN 


Rounding 
Control 

RND1 RNDO 


Product 
Comple- 
ment 

COMPL 


Multiplier/ 
Adder 
Mode 
ACC1 ACCO 


1 1 


1 1 


0 1 


0 


0 0 


0 


X 


0 0 


0 


0 0 


0 0 


0 0 0 0 


1 


o" 



Example 2. Double-Precision Multiply, 64-Bit Result 





Operand 
Select 
R bus 
S bus 


Instruction Inputs 




















Instruction 
Number 


Sign 


Rounding 
Control 


Product 
Comple- 
ment 


Multiplier/ 
Adder 
Mode 


D-MUX 
Select 


Sign 
Extend 


Shift-MUX 
Control 


Register 
Load 
Select 


Register 
Write 
Enable 


Feed- 
through 
Control 


Clock Enables 
I A B Y 


Y-MUX 
Select 


Y/PY 
Output 
Enable 




EA EB 


OASGN DBSGN 


RND1 RNDO 


COMPL 


ACC1 ACCO 


SELD 


SGNEXT 


SFT1 SFTO 


SELREG 


WEH WlL 


FT1 FTO 


CKEl CKEA CKEB CKEY 


SELY 


OEY 


(1) 




0 0 


0 0 


0 


0 0 


0 


0 


1 1 


0 


0 0 


0 0 




0 


0 


(2) 




0 1 


0 0 


0 


0 1 


0 


0 


0 0 


0 


0 0 


0 0 




X 
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(3) 




1 0 


0 0 


0 


0 1 
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0 


1 1 


0 


0 0 


0 0 
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(4) 




1 1 


0 0 
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0 1 
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0 0 
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X X 


0 0 
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(5) 




1 1 


0 0 
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0 1 
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0 0 
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X X 


0 0 
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Example 3. Newton-Raphson Division 





Operand 
Select 
R bus 
S bus 


Instruction Inputs 


























Instruction 
Number 


Sign 
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Sign 
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Shift-MUX 
Control 
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through 
Control 
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Clock Enables 
A B 
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Output 
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DASGN DBSGN 
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Repeat N Times* 
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0 0 
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1 0 
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0 1 
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End Repeat 




































(3) 


0 1 
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1 1 
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1 o 


1 0 
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0 
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(4) 


0 0 
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0 0 


0 
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0 1 
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0 1 


1 0 
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0 



*N - Where m = number of bits in the seed (assuming 32-bits of precision) 

2m +1 



FIGURE 4. MICROCODED EXAMPLES 
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Newton-Raphson binary division algorithm 

The following explanation illustrates how to implement the Newton-Raphson binary division algorithm using 
the 'ACT8836 multiplier/accumulator. The Newton-Raphson algorithm is an iterative procedure that 
generates the reciprocal of the divisor through a convergence method. 

Consider the equation Q = A/B. This equation can be rewritten as Q = A x (1/B). Therefore, the quotient 
Q can be computed by simply multiplying the dividend A by the reciprocal of the divisor (B). Finding the 
divisor reciprocal 1/B is the objective of the Newton-Raphson algorithm. 

To calculate 1/B the Newton-Raphson equation, Xi+ 1 = Xi(2-BXi) is calculated in an iterative process. 
In the equation, B represents the divisor and X represents successively closer approximations to the 
reciprocal 1 1B. The following sequence of computation illustrates the iterative nature of the Newton-Raphson 
algorithm. 

Step 1 X1 = XO(2-BXO) 

Step 2 X2 = XK2-BX1) 
Step 3 X3 = X2(2-BX2) 

Step n Xn = Xn-1 (2-BXn-1 ) 

The successive approximation of Xi, for all i, approaches the reciprocal 1/B as the number of iterations 
increases; that is 

1im Xi = 1/B 
i -* n 

The iterative operation is executed until the desired tolerance or error is reached. The required accuracy 
for 1/B can be determined by subtracting each xi from its corresponding xi + 1. If the difference |Xi + 1 
- Xi | is less than or equal to a predetermined round off error, then the process is terminated. The desired 
tolerance can also be achieved by executing a fixed number of iterations based on the accuracy of the 
initial guess of 1/B stored in RAM of PROM. 

The initial guess, XO, is called the seed approximation. The seed must be supplied to the Newton-Raphson 
process externally and must fall within the range of 0 <X0 <2/B if B is greater than 0 or 2/B <X0<0 
if B is less than 0. 

To perform the Newton-Raphson binary division algorithm using the 'ACT8836, the divisor, B, must be 
a positive fraction. As a positive fraction, B is limited within the range of 1/2 < B < 1. 

Since Xi from Newton-Raphson must lie between 0 < Xi < 2/B and since the range of the positive fraction 
B is 1/2 < B < 1, then the limits of Xi become 1 < Xi <2. 

The range of - BXi will therefore be -2 < -BXi < - 1/2. 

The limits of - BXi are shown in Table 7 as they would appear in the 'ACT8836 extended bit, binary fraction 
format. 

TABLE 7. LIMITS OF -BXj IN ACT8836 EXTENDED BIT FORMAT 





Extended Bits 
66 65 64 


63 


62 


61 ... 
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0 


-2 


1 1 1 
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0 


- y 2 


1 1 1 
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0 ... 
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The diagram indicates that -BXi is always of the form: 

1 1 1 dO . d1 d2 dn-2 dn-1 
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The next step in Newton-Raphson is to complete the 2 - BXi equation. The fractional representation of 2 is: 
0 0 1 0.00 ...........0 0 

Completion of the 2 -BXi equation is shown in Table 8. 

TABLE 8. COMPLETION OF 2- BXj EQUATION 



Extended Bits 
66 65 64 


63 


62 


61 . . . 


1 0 


1 


1 


1 


d 0 


dl 


d 2 ... 


•■• d n _2 d n _i 


+ 0 


0 


1 


0 


0 


0 


. . . 0 0 


= o 


0 


0 


do 


dl 


d 2 ... 


• • • d n _2 d n _v 



Since this step only affects the extended bits (66-64) on the 'ACT8836, this step can be skipped. The 
following algorithm can therefore be used to perform Newton-Raphson binary division with the 'ACT8836. 

Assuming B is on the DB bus (or stored in the B register) and Xi is stored in the temporary register: 

Step 1 



Accumulator *- -(DB x temporary register) 
= 2 -BXi 



Step 2 



Temporary Register *- Left shift one bit of 
(accumulator times temporary register) 
= Xi+1 

= Xi (2 - BXi) 



Step 3 



Repeat Steps 1 and 2 until | Xj + 1 



a predetermined round-off error 



Two cycles are required for each iteration. The left shift that is performed in Step 2 is required to realign 
Xi after the signed fraction multiply. Microcode for this example is shown in Figure 4. 
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absolute maximum ratings over operating free-air temperature range (unless otherwise noted) 1. 



Supply voltage, Vqc • -0.5 V to 6 V 

Input clamp current, l|K (Vj<0 or V|>Vcc) ±20 mA 

Output clamp current, IrjK (Vo<0 or Vo>VcC) ....... ±50 mA 

Continuous output current, Irj (Vfj = 0 to Vqc) ±50 mA 

Continous current through Vcc or GND pins ±100 mA 

Operating free-air temperature range 0°C to 70 °C 

Storage temperature range . -65°C to 150°C 



Stresses beyond those listed under "absolute maximum ratings" may cause permanent damage to the device. These are stress ratings 
only and functional operation of the device at these or any other conditions beyond those indicated under "recommended operating 
conditions" is not implied. Exposure to absolute-maximum-rated conditions for extended periods may affect device reliability. 



recommended operating conditions 





MIN NOM MAX 


UNIT 


Vcc Supply voltage 


4.5 5 5.5 


V 


V|(-f High-level input voltage 


2 V CC 


V 


V|[_ Low-level input voltage 


0 0.8 


V 


'OH High-level output current 


-8 


mA 


IqL Low-level output current 


8 


mA 


V| Input voltage 


o V CC 


V 


Vq Output voltage 


0 v C c 


V 


dt/dv Input transition rise or fall rate 


0 15 


ns/V 


T*a Operating free-air temperature 


0 70 


°C 



electrical characteristics over recommended operating free-air temperature range (unless otherwise 
noted) 



PARAMETER 


TEST CONDITIONS 


v C c 


T A - 25 °C 


T A - 0°C to 70°C 


UNIT 


MIN TYP MAX 


MIN MAX 


VOH 


lOH = -20 M 


4.5 V 


4.4 


4.4 


V 
V 


5.5 V 


5.4 


5.4 


lOH = - 8 mA 


4.5 V 


3.8 


3.7 


5.5 V 


4.8 


4.7 


vol 


lOL = 20 tiA 


4.5 V 


0.1 


0.1 


V 
V 


5.5 V 


0.1 


0.1 


lOL ~ 8 mA 


4.5 V 


0.32 


0.4 


5.5 V 


0.32 


0.4 


'i 


Vj = Vcc or 0 


5.5 V 


0.1 


±1.0 


HA 


! cc 


V| = V C C or o. ! 0 


5.5 V 


50 


100 




Ci 


V| = V C C or 0 


5 V 


5 10 


10 


pF 




One input at 3.4 V, 
other inputs at 0 or Vcc 


5.5 V 


1 


1 


mA 


'OZH 


V| = V C c or 0 


5 V 


0.5 


5 


tiA 


'OZL 


V| = V C C or 0 


5 V 


-0.5 


-5 


nA 



This is the increase in supply current for each input that is at one of the specified TTL voltage levels rather than 0 or Vcc- 
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SN74ACT8836 

32-BIT BY 32-BIT MULTIPLIER/ACCUMULATOR 



setup and hold times 



PARAMETER 


MIN 


MAX 


UNIT 




Instruction before CLKT 


14 




t su 2 


Data before CLKT 


12 




t su 3 


C"KEA before CLKT 


14 






CKEB before CLKT 


14 




t su 5 


CKEI before CLKT 


10 




^6 




19 




t su 7 


SELREG before CLKT 


12 




t su8 


WEM3 before CLKT 


11 




*su9 


WlL§ before CLKT 


11 


ns 


th1 


Instruction after CLKT 


0 




th2 


Data after CLKT 


0 




th3 


CKEA after CLKT 


0 




*h4 


CKEB after CLKT 


0 




l h5 


CKEI after CLKT 


0 




th6 


CKEY after CLKT 


0 




th7 


SELREG after CLKT 


0 




th8 


WEMS after CLKT 


0 




th9 


WELS" after CLKT 


0 





to 
z 

•si 
> 

o 

H 

00 
00 
W 
(Si 



> 

< 
> 
z 
o 



o 

33 
> 
O 
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SN74ACT8836 

32-BIT BY 32-BIT MULTIPLIER/ACCUMULATOR 



switching characteristics over recommended ranges of supply voltage and free-air temperature (see 
Figure 2) for load circuit and voltage waveforms) 



PARAMETER 


FROM 
(INPUT) 


TO 
(OUTPUT) 


FT MODE (FT1-FT0) 


MIN TYP MAX 


UNIT 


tpdl 1 


CLK 


PIPE 


11 


36 




tpd2 f 


PIPE 


Y REG 


11 


36 




tpd3 f 


PIPE 


ACCUM 


11 


36 




t p d4 r 


Y REG 


Y 


All modes 


18 




*pd5 


SELY 


Y 


All modes 


18 




tpd6 f 


CLK 


Y REG 


01 


54 






CLK 


ACCUM 


10 or 01 


67 




tprjg 


CLK 


Y 


10 


67 




tpd9 


DATA 


Y 


00 


60 




tpdlO^ 


DATA 


ACCUM 


00 


56 




*pd11 


CLK 


YETP 


1 1 or 10 


18 




tpd 1 2 


CLK 


ETPERR 


1 1 or 1 0 


18 




*pd1 3 


CLK 


YETP 


00 


67 




^pdl 4 


CLK 


ETPERR 


01 


67 




tprj-J 5 


DATA 


YETP 


00 


60 




tpd16 


DATA 


ETPERR 


00 


60 




tpd17 


PA 


PERRA 


All modes 


20 




tpd18 


DA 


PERRA 


All modes 


20 




tpd19 


PB 


PERRB 


All modes 


20 




tpd20 


DB 


PERRB 


All modes 


20 




*pd21 


PY 


PERRY 


All modes 


20 




tpd22 


Y 


MSERR 


All modes 


22 




*pd23 


YETP 


MSERR 


All modes 


22 




ten2 


Uey" 


YETP 


All modes 


20 




*en1 


Uey 


Y 


All modes 


20 




*dis1 


0"EY 


YETP 


All modes 


15 




*dis2 


OEY 


Y 


All modes 


15 




clock requirements 


PARAMETER 


SN74ACT8836 


UNIT 


MIN MAX 


t w i CLK high 


5 


ns 


t w 2 CLK low 


20 



t These parameters cannot be measured but can be inferred from device operation and other measurable parameters. 
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SN74ACT8836 

32 BIT BY 32 BIT MULTIPLIER/ACCUMULATOR 



PARAMETER MEASUREMENT INFORMATION 



CLK 1///////////////////////////////////////////////////////////////////^ 

CKEA. CKEB__ ; ; , ; ; , ; , 

CKEI, CKEY — 

I 



INSTR ZZXZ 



DATA 22X1 



SELY. 



-t p d9- 



-«en2 



wvvv 



^7777 



-tpd5 



tdis2 



Y3i -yo v\vvvvvvvvvvvvvvv\\\\\\\\\^\vvvvvvvvvvvvv-^ ir 




FIGURE 5. FULL FLOWTHROUGH MODE (FT - 00) 



CO 

z 

> 

O 
H 
00 
00 
CJ 



> 

< 
> 
z 
o 

m 



m 
O 

> 

Hj 

O 



CKEA, CKEB 



23T 
instr 



data!2C 



l 



SELREG 
WEMS, WELS 



I 



-th7"th9- 



SUM-OF- — V- 
PRODUCTT 



DC 

i 



-tpd10- 



SELY L_ 



I ' 



\\\\ 



{7777 



~t p d9- 



tpd5- 



tdis2 



Y31-Y0 \\\\\\\\ \ \ \\\\\\\\ \\\\\\\\\\\\ \\\\ \\\\\ WT \ X ^ : 



zx: 



dig 



FIGURE 6. FULL FLOWTHROUGH MODE, ACCUMULATOR MODE (FT - 00) 
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SN74ACT8836 

32-BIT BY 32 BIT MULTIPLIER/ACCUMULATOR 



PARAMETER MEASUREMENT INFORMATION 



CLK 



CKEA, CKEB 
CKEI, CKEY 



K~ tsu3tsu6->» 



INSTR ZDC 



-*su1- 



dataIDC 



-t S u2- 



PRODUCT . 




-H— th3-th6 



-th1- 



-th2- 



-t pd 6- 



SELY . 



I 



N * t pd4 

I 



^7777 £ 



tdis2-M-« 



tpd5-»*- 



t en 2-H H 

FIGURE 7 FLOWTHROUGH PIPE ONLY VOLTAGE WAVEFORMS (FT - 01) 



CO 
CO 
00 
00 
H 

o 
< 

z 



O 

< 

cc 
O 

LL 



kJU 
O 



< 

> 

< 



Texas 
Instruments 

POST OFFICE BOX 655012 • DALLAS, TEXAS 75265 



4-29 



SN74ACT8836 

32-BIT BY 32-BIT MULTIPLIER/ACCUMULATOR 



PARAMETER MEASUREMENT INFORMATION 



CLK 



CKEA. CKEB 
CKEI, CKEY 



instrSXI 



-+I ^t su 3-t su6 



DATA SIC 



t4-*SUl 
|4 

^ S 



-th1- 



W-t su 2 



3C 

i 



-th2- 



xxxxxxxxxxx y 



tsu7-tsu9-U- 



SUM-OF- 
PRODUCT 




-tpd7~ 



-t pt J6- 



z 

> 
o 

■H 
00 
00 
CO 

o 



oey 



zx 

DC 



/;/;;/;//;///;////;;//////// ///;;/// 



i 



-*h7-th9- 



~*pd4 



Y31 -Y0 XXXXXXXXXXXXXXXXXXXXxi* lsP" 



th3-th6- 



ZXZEI 



DC 



x: 



^ /////////////////////// 



ten2-M- 



I 

-*-*pd5 



I 



I | 

//;;/////////;/////// 

i 



FIGURE 8. FLOWTHROUGH PIPE ONLY, ACCUMULATOR MODE (FT - 01} 
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SN74ACT8836 

32 BIT BY 32 BIT MULTIPLIER/ACCUMULATOR 



PARAMETER MEASUREMENT INFORMATION 



CLK 



CKEA, CKEB 
CKEI 



INSTR 3IC 



DATA^IC 



-tsuS-tsuS- 



-tsul- 



-tsu2- 



Vl-th3-th5-^ 



-thi- 



-th2- 



oey \ \\ \ \\\\\\\ \ \\\\\S \\\S\\S\\\\\S\\S\>^ 



-ten2 



-tpd8- 



U N-tpd5 



tdis2 



ysi-yonSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSK: 



FIGURE 9. FLOWTHROUGH PIPE AND Y ONLY (FT - 10} 



3<5 



CKEA. CKEB _i j 
CKEI | 



M~t su 3-t SU 5 



INSTR 3X1 



k-tsu1 

k t h1 _ 



T 

*-M-tsu2 



_t h2 - 



i 
i 
I 



I 

PBnn,?rT //////////////////XL 



-*pd7- 



SELY. 



-tpd8- 



t S u7t S u9 j 
*h7-th9 ^ 



♦Men2 
M l pd5-W 



<?///////////////< 



t dis2-k—W 



th3"th5 



V//////////////////// 



if////////////////////////// 



Y31-Y0 ////////////////zzzk: 



DC 



FIGURE 10. FLOWTHROUGH PIPE AND Y ONLY, ACCUMULATOR MODE (FT * 10) 
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SN74ACT8836 

32 BIT BY 32 BIT MULTIPLIER/ACCUMULATOR 



PARAMETER MEASUREMENT INFORMATION 



CLK . 

CKEA, CKEB 
CKEI, CKEY 

INSTR 



k~t SU 3-t su 6 



ssLL 



—♦I |*-^SUl 

k— t h1 - 



r>ATA~X t A,B Ml 



w t h2 — ^ I 

INTR. PRODUCT SSS^SSSSSSSSSC 



3EI 



k t pd1 * 

PRODUCT 



SELY. 



I 



I 



I I | V»-tpd4-H 

I I I I I 

Y31-Y0 ^^\\\\\\\\\SSS^ 



I 



z 

> 

o 

■ H 
00 
00 
00 

« 



\*— «en2— « 




«h3"«h6- 



-*Hp d5 



»-<dis2 



FIGURE 11. ALL REGISTERS ENABLED (FT - 11) 
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SN74ACT8836 

32 BIT BY 32 BIT MULTIPLIER/ACCUMULATOR 



PARAMETER MEASUREMENT INFORMATION 



CLK. 



CKEA, CKEB j 
CKEI, CKEY -|- 
4- 



INSTR DK 



|4 t h1 »l I 

DATA- yj— a. B( i> 



IIMTR. PRODUCT 



f — -Wi 
SELREG vv^vvvv^ 




WEMS, WELS ' 



| t S u7-*su9 

PRODUCT^^^^^^ 

t pd3 ^- 



I 



I 



product ^s\\s\sss\s\\\ss\s\ss\>cz3ae: 



I 



tpd2H*- 



oey ss^sssssss^ 



1 i l«-w-tpd4 

! tpd5-*«-»t 

Y31-Y0 ispi X ^£ 



N-th7th9-*< 



"* X accum" 



i 



th3 th6-H p- 

li2S 



-4- 



I 



-HJ-t en2 



t p d5 



FIGURE 12. ALL REGISTERS ENABLED, ACCUMULATOR MODE (FT 

-Vcc 



11) 



FROM output 

UNDER TEST 




PARAMETER 






s 12 


S2 


ten 


tPZH 


1 kSl 


50 pF 


OPEN 


CLOSED 


tPZL 


CLOSED 


OPEN 


*dis 


«PHZ 


1 kfi 


50 pF 


OPEN 


CLOSED 


tpLZ 


CLOSED 


OPEN 


*pd 




50 pF 


OPEN 


OPEN 



*C|_ includes probe and test fixture capacitance 
LOAD CIRCUIT 

input pulses are supplied by generators having the following characteristics: PRR < 1 MHz, Z ou t = 50 S3, t r = 

FIGURE 13. LOAD CIRCUIT 



50 li, tf = 6 ns. 
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SN74ACT8818 16-Bit Microsequencer 



SN74ACT8832 32-Bit Registered ALU 



roHAIII 



SN74ACT8836 32- x 32-Bit Parallel Multiplier 




SIM74ACT8837 64-Bit Floating Point Processor 



SN74ACT8841 Digital Crossbar Switch 



SN74ACT8847 64-Bit Floating Point/Integer Processor 




Support 



Mechanical Data 
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SN74ACT8837 
64-Bit Floating Point Unit 

• Multiplier and ALU in One Chip 

• 60-ns Pipelined Performance 

• Low-Power EPIC™ CMOS 

• Meets IEEE Standard for 32- and 64-Bit Multiply, 
Add, and Subtract 

• Three-Port Architecture, 64-Bit Internal Bus 

• Pipelined or Flowthrough Operation 

• Floating Point-to-integer and Integer-to-Floating 
Point Conversions 

• Supports Division Using Newton-Raphson 
Algorithm 

• Parity Generation/Checking 

The SN74ACT8837 single-chip floating point processor performs high-speed 32- jv* 
and 64-bit floating point operations. More than just a coprocessor, the 'ACT8837 00 
integrates on one chip, two double-precision floating point functions, an ALU 
and multiplier. K 

O 

The wide dynamic range and high precision of floating point format minimize ^ 
the need for scaling and overflow detection. Computationally-intense ^ 
applications, such as high-end graphics and digital signal processing, need double- z 
precision floating point accuracy to maintain data integrity. Floating point C/> 
processors in general-purpose computing must often support double-precision 
formats to match existing software. 

By integrating its two functions on one chip, the 'ACT8837 reduces data routing 
problems and processing overhead. Its three data ports and 64-bit internal bus 
structure let the user load two operands and take a result in a single clock cycle. 



EPIC is a trademark of Texas Instruments Incorporated. 
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Introduction 



Each of these floating point units (FPU), the SN74ACT8837 combines a multiplier and 
an arithmetic-logic unit in a single microprogrammable VLSI device. The 'ACT8837 
is implemented in Texas Instruments one-micron CMOS technology to offer high speed 
and low power consumption in an FPU with exceptional flexibility and functional 
integration. The FPU can be microprogrammed to operate in multiple modes to support 
a variety of floating point applications. 

The 'ACT8837 is fully compatible with the IEEE standard for binary floating point 
arithmetic, STD 754-1985. This FPU performs both single- and double-precision 
operations, including division and square-root using the Newton-Raphson algorithm. 

Understanding the 'ACT8837 Floating Point Unit 

To support floating point processing in IEEE format, the 'ACT8837 may be configured 
for either single- or double-precision operation. Instruction inputs can be used to select 
three modes of operation, including independent ALU operations, independent multiplier 
operations, or simultaneous ALU and multiplier operations. 

Three levels of internal data registers are available. The device can be used in 
flowthrough mode (all registers disabled), pipelined mode (all registers enabled), or 
in other available register configurations. An instruction register, a 64-bit constant 
register, and a status register are also provided. ^ 

CO 

The FPU can handle three types of data input formats. The ALU accepts data operands CO 
in integer format or IEEE floating point format. In the 'ACT8837, integers are converted . £2 
to normalized floating point numbers with biased exponents prior to further processing. (J 
A third type of operand, denormalized numbers, can also be processed after the ALU *t 
has converted them to "wrapped" numbers, which are explained in detail in a later 
section. The 'ACT8837 multiplier operates only on normalized floating-point numbers Z 
or wrapped numbers. ^ 

Microprogramming the 'ACT8837 

The 'ACT8837 is a fully microprogrammable device. Each FPU operation is specified 
by a microinstruction or sequence of microinstructions which set up the control inputs 
of the FPU so that the desired operation is performed. 
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The microprogram which controls operation of the FPU is stored in the microprogram 
memory (or control store). Execution of the microprogram is controlled by a 
microsequencer such as the Tl SN74ACT8818 16-bit microsequencer. A discussion 
of microprogrammed architecture and the operation of the 'ACT8818 is presented 
in this Data Manual. 

Support Tools 

Texas Instruments has developed a functional evaluation model of the 'ACT8837 in 
software which permit designers to simulate operation of the FPU. To evaluate the 
functions of an FPU, a designer can create a microprogram with sample data inputs, 
and the simulator will emulate FPU operation to produce sample data output files, as 
well as several diagnostic displays to show specific aspects of device operation. Sample 
microprogram sequences are included in this section. 

Texas Instruments has also designed a family of low-cost real-time evaluation modules 
(EVM) to aid with initial hardware and microcode design. Each EVM is a small self- 
contained system which provides a convenient means to test and debug simple 
microcode, allowing software and hardware evaluation of components and their 
operation. 

At present, the 74AS-EVM-8 Bit-Slice Evaluation Module has been completed, and 
a 1 6-bit EVM is in an advanced stage of development. EVMs and support tools for 
El devices in the VLSI family are planned for future development. 

Design Support 

c/> 

2 Texas Instruments Regional Technology Centers, staffed with systems-oriented 

*sl engineers, offer a training course to assist users of Tl LSI products and their application 

^ to digital processor systems. Specific attention is given to the understanding and 

O generation of design techniques which implement efficient algorithms designed to 

H match high-performance hardware capabilities with desired performance levels. 

00 

00 

Ca) 
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Information on VLSI devices and product support can be obtained from the following 
Regional Technology Centers: 



Atlanta 

Texas Instruments Incorporated 
3300 N.E. Expressway, Building 8 
Atlanta, GA 30341 
404/662-7945 

Boston 

Texas Instruments Incorporated 
950 Winter Street, Suite 2800 
Waltham, MA 02154 
617/895-9100 

Northern California 
Texas Instruments Incorporated 
5353 Betsy Ross Drive 
Santa Clara, CA 95054 
408/748-2220 

Design Expertise 

Texas Instruments can provide in-depth technical design assistance through 
consultations with contract design services. Contact your local Field Sales Engineer 
for current information or contact VLSI Systems Engineering at 214/997-3970. 



Chicago 

Texas Instruments Incorporated 

51 5 Algonquin 

Arlington Heights, IL 60005 

312/640-2909 

Dallas 

Texas Instruments Incorporated 
10001 E. Campbell Road 
Richardson, TX 75081 
214/680-5066 

Southern California 

Texas Instruments Incorporated 

17891 Cartwright Drive 

Irvine, CA 92714 

714/660-8140 



CO 
00 
00 
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'ACT8837 Logic Symbol 



CLK 
CLKC - 

CLKMODE - 
BYTEP - 
CONFIG1-0 - 
FAST - 

RND1-0 - 
SRCC - 

SELOP7-0 - 
SELST1-0 - 
SELMS/LS - 



c/> 

> 

o 

H 

00 
00 
W 



10 - 

11 - 

12 - 
■3- 

14 - 

15 - 

16 - 

17 - 

18 - 

19 - 

ENRA - 
ENRB - 
OES - 
OEC - 
OEY - 



ACT8837 
64-Bit Floating Point Unit 



> MASTER CLOCK (EXCEPT C REGISTER) 

> C REGISTER CLOCK 



CLEARS STATES 
& STATUS 

STALLS OPERATION 



CLOCK EDGE 
PARITY GENERATION 
DATA SOURCE 
SUDDEN 
GRADUAL 



UNDER 
FLOW 



ROUNDING MODE 
MULTIPLIER | 

C RE 

ALU I 

OPERAND SOURCE 
STATUS SOURCE 
MSH ] 
LSH 



Y BUS 



INSTRUCTION, RA, & RB 
REGISTERS 



ALU AND MULTIPLIER 
PIPELINE REGISTERS 



STATUS, P, AND S 
REGISTERS 



PARITY 

I/O 



STATUS 
PARITY 



TEST PINS 
0 



INSTRUCTIONS 



FLOWTHROUGH 
EN 

FLOWTHROUGH 
EN 

FLOWTHROUGH 
EN 

DA DATA 
DB DATA 
Y BUS 

DA DATA 
DB DATA 
MASTER/SLAVE 
COMPARATOR 



9 

LOAD RA REGISTER 
LOAD RB REGISTER 
EXCEPTION & OTHER STATUS 
COMPARISON STATUS 
Y31-Y0, PY3-PY0 



COMPARISON 
STATUS 



EXCEPTION 
AND 
OTHER 
STATUS 



• RESET 

• HALT 



PIPESO 



-< PIPES 1 



- PA3-0 

- PB3-0 

- PY3-0 

- PERRA 

- PERRB 

- MSERR 

- UNORD 

- AGTB 

- AEQB 

- IVAL 

- INEX 

- OVER 

- UNDER 

- DENORM 

- DENIN 

• RNDCO 

SRCEX 

CHEX 

■ STEX1- 
STEXO 



DA31 - 
DBO - 



31 
0 



:> 
0> 



< DATA > 
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'ACT8837 Pin Descriptions 

Pin descriptions and grid allocations for the 'ACT8837 are given on the following pages. 



208 PIN . . . GB PACKAGE 
{TOP VIEW) 

2 3 4 5 6 7 8 9 10 1112 13 14 15 16 17 



Table 1 . 'ACT8837 Pin Grid Allocations 





PIN 




PIN 




PIN 




PIN 




PIN 




PIN 


NO. 


NAME 


NO. 


NAME 


NO. 


NAME 


NO. 


NAME 


NO. 


NAME 


NO. 


NAME 


A1 


NIC 


C2 


YO 


E3 


FAST 


J15 


NC 


P1 


NC 


S1 


NC 


A2 


NC 


C3 


Y3 


E4 


GND 


J16 


SRCC 


P2 


PIPESO 


S2 


PBO 


A3 


Y5 


C4 


Y6 


E14 


GND 


J17 


BYTEP 


P3 


RESET 


S3 


DBO 


A4 


Y8 


C5 


Y9 


E15 


AGTB 


K1 


SEL0P3 


P4 


PB1 


S4 


DB4 


A5 


Y1.1 


C6 


Y12 


E16 


AEQB 


K2 


SEL0P4 


P5 


DB1 


S5 


DB1 1 


A6 


Y14 


C7 


Y15 


E17 


MSERR 


K3 


SEL0P5 


P6 


DB5 


S6 


DB12 


A7 


Y17 


C8 


Y18 


F1 


15 


K4 


GND 


P7 


DB9 


S7 


DB15 


A8 


Y20 


C9 


Y23 


F2 


13 


K14 


GND 


P8 


DB16 


S8 


DB19 


A9 


Y21 


C10 


Y26 


F3 


RNDO 


K15 


PA1 


P9 


DB21 


S9 


DB23 


A10 


Y24 


C11 


Y30 


F4 


GND 


K16 


PA2 


P10 


DB28 


S10 


DB26 


A11 


Y27 


C12 


PY1 


F14 


GND 


K17 


PA3 


P11 


DAO 


S11 


DB30 


A12 


Y29 


C13 


UNDER 


F15 


PERRA 


L1 


SEL0P6 


P12 


DA4 


S12 


DA2 


A13 


PYO 


C14 


INEX 


F16 


OEY 


L2 


SEL0P7 


P13 


DA8 


S13 


DA6 


A14 


PY3 


C15 


DENIN 


F17 


OES 


L3 


CLK 


P14 


DA12 


S14 


DA10 


A15 


IVAL 


C16 


SRCEX 


G1 


17 


L4 


v C c 


P15 


DA19 


S15 


DA14 


A16 


NC 


C17 


CHEX 


G2 


16 


L14 


GND 


P16 


DA22 


S16 


DA15 


A17 


NC 


D1 


11 


G3 


14 


L15 


DA30 


P17 


DA23 


S17 


DA17 


B1 


NC 


D2 


RND1 


G4 


vcc 


L16 


DA31 


R1 


PIPES1 


T1 


NC 


B2 


Y2 


D3 


Y1 


G14 


vcc 


L17 


PAO 


R2 


HALT 


T2 


PB3 


B3 


Y4 


D4 


GND 


G15 


OEC 


M1 


ENRB 


R3 


PB2 


T3 


DBS 


84 


Y7 


D5 


vcc 


G16 


SELMS/LS 


M2 


ENRA 


R4 


DB2 


T4 


DB7 


B5 


Y10 


06 


GND 


G17 


TP1 


M3 


CLKC 


R5 


DB6 


T5 


DB8 


B6 


Y13 


D7 


GND 


H1 


19 


M4 


GND 


R6 


DB10 


T6 


DB13 


B7 


Y16 


08 


vcc 


H2 


NC 


M14 


v C c 


R7 


DB14 


T7 


DB17 


B8 


Y19 


D9 


GND 


H3 


18 


M15 


DA27 


R8 


DB18 


T8 


DB20 


B9 


Y22 


D10 


GND 


H4 


GND 


M16 


DA28 


R9 


DB22 


T9 


DB24 


B10 


Y25 


011 


vcc 


H14 


GND 


M17 


DA29 


R10 


DB27 


T10 


DB25 


81 1 


Y28 


D12 


GND 


H15 


TPO 


N1 


CONFIGO 


R11 


DB31 


T11 


DB29 


B12 


Y31 


D13 


GND 


H16 


SELST1 


N2 


C0NFIG1 


R12 


DA3 


T12 


DAI 


B13 


PY2 


D14 


vcc 


H17 


SELSTO 


N3 


CLKMODE 


R13 


DA7 


T13 


DA5 


B14 


OVER 


D15 


STEX1 


J1 


SEL0P2 


N4 


PIPES2 


R14 


DA11 


T14 


DA9 


B15 


RNDCO 


D16 


STEXO 


J2 


SEL0P1 


N14 


DA18 


R15 


DA16 


T15 


DA13 


B16 


DENORM 


D17 


UNORD 


J3 


SELOPO 


N15 


DA24 


R16 


DA20 


T16 


NC 


B17 


NC 


E1 


12 


J4 


vcc 


N16 


DA25 


R17 


DA21 


T17 


NC 


C1 


PERRB 


E2 


10 


J14 


vcc 


N17 


DA26 











CO 

oo 
oo 
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Table 2. 'ACT8837 Pin Functional Description 



PIN 

NAME NO. 



I/O 



DESCRIPTION 



AEQB E16 



I/O 



Comparison status 1 zero detect pin. When high, 
indicates that A and B operands are equal during a 
compare operation in the ALU. If not a compare, a 
high signal indicates a zero result. 



AGTB E15 



I/O 



Comparison status pin. When high, indicates that A 
operand is greater than B operand. ■ 



BYTEP J17 



When high, selects parity generation for each byte 
of input (four parity bits for each bus). 
When low, selects parity generation for whole 
32-bit input (one parity bit for each bus). 



CHEX 



C17 



I/O 



Status pin indicating an exception during a chained 
function. If 16 is low, indicates the multiplier 
is the source of the exception. If 16 is high, 
indicates the ALU is the source of the exception. 



CLK 



L3 



Master clock for all registers except C register 



CLKC 



M3 



C register clock 



CLKMODE N3 



Selects whether temporary register loads only on 
rising clock edge (CLKMODE = L) or on falling 
edge (CLKMODE = H). __ 



CONFIGO 
CONFIG1 



N1 
N2 



Select data sources for RA and RB registers from 
DA bus, DB bus and temporary register. - 



DAO 

DA1 

DA2 

DA3 

DA4 

DA5 

DA6 

DA7 

DA8 

DA9 

DA10 

DA1 1 

DA12 

DA13 

DA14 

DA15 

DA16 

DA 17 

DA18 

DAI 9 

DA20 

DA21 

DA22 

DA23 



P11 
T12 
S12 
R12 
P12 
T13 
S13 
R13 
P13 
T14 
S14 
R14 
P14 
T15 
S15 
S16 
R15 
S17 
N14 
P15 
R16 
R17 
P16 
P17 



DA 32-bit input data bus. Data can be latched in a 
64-bit temporary register or loaded directly into an 
input register. 
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Table 2. ACT8837 Pin Functional Description (Continued) 



PIN 



MAR AC 


NU. 


DA24 


N 1 5 


DA25 


N16 


DA26 


N17 


DA27 


M15 


DA28 


M16 


DA29 


■Ml 7 


DA30 


L15 


DA31 


L16 


DBO 


S3 


DB1 


P5 


DB2 


R4 


DB3 


T3 


DB4 


S4 


DB5 


P6 


DB6 


R5 


DB7 


T4 


DB8 


T5 


DB9 


P7 


DB10 


R6 


DB1 1 


S5 


DB12 


S6 


DB13 


T6 


DB14 


R7 


DB15 


S7 


DB16 


P8 


DB17 


T7 


DB18 


R8 


DB19 


S8 


DB20 


T8 


DB21 


P9 


DB22 


R9 


DB23 


S9 


DB24 


T9 


DB25 


T10 


DB26 


S10 


DB27 


R10 


DB28 


P10 


DB29 


T11 


DB30 


S1 1 


DB31 


R11 



I/O 



DESCRIPTION 



DA 32-bit input data bus. Data can be latched in a 
64-bit temporary register or loaded directly into an 
input register 



DB 32-bit input data bus. Data can be latched in a 
64-bit temporary register or loaded directly into an 
input register 



CO 

oo 
oo 
h- 
o 
< 

z 



DENIN 



C15 



I/O 



Status pin indicating a denormal input to the 
multiplier. When DENIN goes high, the STEX pins 
indicate which port had the denormal input. 
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Table 2. ACT8837 Pin Functional Description (Continued) 



PIN 
NAME 


NO. 


I/O 


DESCRIPTION 








Status pin indicating a denormal output from the 


DENORM 


B16 


I/O 


ALU or a wrapped output from the multiplier. In 
FAST mode, causes the result to go to zero when 
DENORM is high. 








When high, enables loading of RA register on a 


ENRA 


M2 


I 


rising clock edge if the RA register is not disabled 
(see PIPESO below). 








When high, enables loading of RB register on a 


ENRB 


M1 


I 


rising clock edge if the RB register is not disabled 
(see PIPESO below). 








When low, selects gradual underflow (IEEE mode). 


FAST 


E3 


I 


When high, selects sudden underflow, forcing all 
denormalized inputs and outputs to zero. 


GND 


D4 






GND 


D6 






GND 


D7 






GND 


D9 






GND 


D10 






GND 


D12 






GND 


D13 






GND 


E4 




Ground pins. NOTE: All ground pins should be 


GND 


E14 




used and connected. 


GND 


F4 






GND 


F14 






GND 


H4 






GND 


H14 






GND 


K4 






GND 


K14 






GND 


L14 






GND 


M4 






II A 1 T 

HAL 1 


DO 


1 

1 


Stalls operation without altering contents of 




instruction or data registers. Active low. 


10 


E2 






11 


DT 






12 


E1 






13 


F2 






14 


G3 


1 


Instruction inputs 


15 


F1 






16 


G2 






17 


G1 






18 


H3 






19 


H1 






INEX 


C14 


I/O 


Status pin indicating an inexact output 
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Table 2. ACT8837 Pin Functional Description (Continued) 



PIN 

NAME NO. 


I/O 


DESCRIPTION 


IVAL A15 


I/O 


Status pin indicating that an invalid operation or a 
nonnumber (NaN) has been input to the multiplier 
or ALU. 


MSERR E17 


0 


Master/Slave error output pin 


A1 
A2 
A16 

A 1 "7 

AT / 
R1 

B17 

NC H2 
J15 
P1 
S1 
T1 
T16 
T17 




Mn intprnal pnnnpptinn Pine chni ilH hp Ipft f Inatinn 
I v kj iiiigiiigi irUi it i cu uui i . i iiio oi iuuiu uc ici i i lua n i ly • 


OEC G15 


I 


Comparison status output enable. Active low. 


OES F17 


I 


Exception status and other status output enable. 
Active low. 


OEY F16 


I 


Y bus output enable. Active low. 


OVER B14 

V-/ V LI 1 U 1 


I/O 


Status pin indicating that the result is greater the 

iarriPQt allnwahlp v/aImp for QnppifipH fnrmat 

lUl y V^O I OllKJ VV UUIC VulUC IUI OpCOl 1 1 W IVJMIiai 

(exponent overflow). 


PAO L17 
PA1 K15 
PA2 K16 
PA3 K17 


I 


Parity inputs for DA data 


PBO S2 
PB1 P4 
PB2 R3 
PB3 T2 


I 


Parity inputs for DB data 


PERRA F15 


0 


DA data parity error output. When high, signals a 
byte or word has failed an even parity check. 


PERRB C1 


0 


DB data parity error output. When high, signals a 
byte or word has failed an even parity check. 


PIPESO P2 


I 


When low, enables instruction register, RA and RB 
input registers. When high, puts instruction 
register, RA and RB registers in flowthrough mode. 


PIPES1 R1 


I 


When low, enables pipeline registers in ALU and 
multiplier. When high, puts pipeline registers in 
flowthrough mode.. 



CO 
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Table 2. ACT8837 Pin Functional Description (Continued) 



PIN 

NAME NO. 


I/O 


DESCRIPTION 


PIPES2 N4 


I 


When low, enables status register, product (P) and 
sum (S) registers. When high, puts status register, 
P and S registers in flowthrough mode. 


PYO A13 
PY1 C12 
PY2 B13 
PY3 A14 


I/O 


Y port parity data 


RESET P3 


I 


Clears internal states and status with no effect to 
data registers. Active low. 


RNDO F3 
RND1 D2 


I 


Rounding mode control pins. Select four IEEE 
rounding modes (see Table 18). 


RNDCO B15 


I 


When high, indicates the mantissa of a wrapped 
number has been increased in magnitude by 
rounding. 


C C 1 IV A C /T"C 

okLiVlb/Lo bit) 


1 
1 


When low, selects LSH of 64-bit result to be 
output on the Y bus. When high, selects MSH of 
64-bit result. 


otLUrU Jo 
SEL0P1 J2 
SEL0P2 J1 

SEL0P4 K2 
SEL0P5 K3 
SEL0P6 L1 
SEL0P7 L2 


1 


Qole»r»t nnoranH cmirooc for miiltinlior and Al II 

(See Tables 6 and 7) 


SELSTO H17 
SELST1 H16 


1 


Select status source during chained operation 
(see Table 1 6) 


SRCC J16 


1 


When low, selects ALU as data source for C 
register. When high, selects multiplier as data 
source for C register. 


SRCEX C16 


I/O 


Status pin indicating source of status, either 
ALU (SRCEX = L) or multiplier (SRCEX = H) 


STEXO D16 
STfcXi Dip 


I/O 


Status pins indicating that a nonnumber (NaN) or 
denormal number has been input on A port 
(STEX1) or B port (STEXO). 


TPO H15 
TP1 G17 


1 


Test pins (see Table 19) 


UNDER C13 


I/O 


Status pin indicating that a result is inexact and 
less than minimum allowable value for format 
(exponent underflow). 


UNORD D17 


I/O 


Comparison status pin indicating that the two 
inputs are unordered because at least one of them 
is a nonnumber (NaN). 
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Table 2. ACT8837 Pin Functional Description (Concluded) 



PIN 



NAME 



NO. 



Vrr 


D5 


Vrr 


D8 


Vrr 


D1 1 


Vrr 


D14 


Vrr 


G4 


Vrr 


G14 


Vrr 


J4 


Vrr 


J14 


Vrr 


L4 


Vrr 


M14 


tu 


r o 


VI 
Y I 


UO 


V") 
YZ 


DO 


VQ 
Y O 


v*o 


v/i 
Y*f 


DO 


Y O 


A Q 


YD 


HA 


V7 
Y / 


RA 
D4 


VQ 
TO 


Ail 


VQ 

y y 


pc 


V 1 A 

Y I U 


DC 
DO 


III 


Ab 


Y 1 z 


re 


V10 

Y I o 


DO 


Y 14 


A C 

Ab 


Y i b 


L / 


V1C 

Y 1 O 


D"7 
D / 


Y 1 / 


A "7 

A / 


V 1 o 
Ylo 


to 


V1Q 

y i y 


DO 
DO 


YZU 


A8 


Y21 


A9 


Y22 


B9 


Y23 


C9 


Y24 


A10 


Y25 


B10 


Y26 


C10 


Y27 


A11 


Y28 


B11 


Y29 


A12 


Y30 


C11 


Y31 


B12 



I/O 



I/O 



DESCRIPTION 



5-V power supply 



32-bit Y output data bus 



CO 

oo 
oo 
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'ACT8837 Specification Tables 



absolute maximum ratings over operating free-air temperature range 
(unless otherwise noted) t 



Supply voltage, Vcc -0.5 V to 6 V 

Input clamp current, l|K (V| < 0 or V| > Vqc) • • • ±2 0 mA 

Output clamp current, Iok <0 or Vo > Vcc) ±50 mA 

Continuous output current, lo (Vo = 0 to Vcc) ±50 mA 

Continuous current through Vcc or GND pins ........ ± 100 mA 

Operating free-air temperature range 0°C to 70°C 

Storage temperature range - 65°C to 150°C 



t Stresses beyond those listed under "absolute maximum ratings" may cause permanent damage 
to the device. These are stress ratings only and functional operation of the device at these or 
any other conditions beyond those indicated under "recommended operating conditions" is 
not implied. Exposure to absolute-maximum-rated conditions for extended periods may affect 
device reliability. 



recommended operating conditions 



PARAMETER 


SN74ACT8837 


UNIT 


MIN 


NOM MAX 


vcc 


Supply voltage 


4.75 


5.0 5.25 


V 


V|H 


High-level input voltage 


2 




V 


V|L 


Low-level input voltage 


0 


, 0.8 


V 


lOH 


High-level output current 


-8 


mA 


>OL 


Low-level output current 


8 


mA 


V| 


Input voltage 


7f^ • ! 

P v cc 


V 


v 0 


Output voltage 


0 


v cc 


V 


dt/dv 


Input transition rise or fall rate 


0 


15 


ns/V 


T A 


Operating free-air temperature 


0 


70 


°C 
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electrical characteristics over recommended operating free-air 
temperature range (unless otherwise noted) 



PARAMETER 


TEST CONDITIONS 


v C c 


A 


OIV / HMU 1 OOO / 


UNIT 


MIN TYP MAX 

IVIIIv III ivifnyv 


MIN TYP MAY 

IVIIIM 1 lr IVIMyV 


VOH 


Iqh - 20 ijlA 


4.5 V 






V 


5.5 V 






Iqh - 8 mA 


4.5 V 




3.76 


5.5 V 




4.76 

— r^m 


vol 


Iql = 20 nA 


4.5 V 






V 


5.5 V 






Iql ~ 8 mA 


4.5 V 


^ — 


0.45 


5.5 V 




0.45 


»i 


V| = V C C or 0 


5.5 V 




±1 


tiA 


'cc 


V| = V C C or 0, l 0 


5.5 V 




200 


HA 




Vj = V C c or 0 


5 V 






PF 



switching characteristics (see Note) 



PARAMETER 


SN74ACT8837-65 
MIN MAX 


UNIT 




Propagation delay from DA/DB/I inputs 
to Y output 


125 


ns 


*pd2 


Propagation delay from input register to 
output buffer 


118 


ns 


tpd3 


Propagation delay from pipeline register to 
output buffer 


_ ■ 


ns 


*pd4 


Propagation delay from output register to 
output buffer 


30 


ns 


tpd5 


Propagation delay from SELMS/LS to Y output 


32 


ns 


tdi 


Propagation delay from input register to 
output register 


95 


ns 


td2 


Delay time, input register to pipeline register or 
pipeline register to output register 


65 


ns 



Note: Switching data must be used with timing diagrams for different operating modes. 
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setup and hold times 



PARAMETER 


5N74Av/loo37-oo 


UNIT 


MIN MAX 


t su -| Setup time, Instruction before CLKt 


18 


j& ns 


t S u2 Setup time, data operand before CLKt 





ns 


Setup time, data operand before second CLKt 
*su3 f° r double-precision operation (input register 
not enabled) 




ns 


t n i Hold time, Instruction input after CLKt 


0 


ns 


clock requirements 


PARAMETER 


SN74ACT8837-65 






t w Pulse duration 


CLK high 




ns 


CLK low 


-my-- 


Clock period 




ns 



z 

> 

o 



00 
00 

CO 
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SN74ACT8837 FLOATING POINT UNIT 

The SN74ACT8837 is a high-speed floating point unit implemented in Tl's 
advanced 1-/*m CMOS technology. The device is fully compatible with IEEE 
Standard 754-1985 for addition, subtraction and multiplication operations. 

The 'ACT8837 input buses cap be configured to operate as two 32-bit data buses 
or a single 64-bit bus, providing a number of system interface options. Registers are 
provided at the inputs, outputs, and inside the ALU and multiplier to support multilevel 
pipelining. These registers can be bypassed for nonpipelined operation. 

A clock mode control allows the temporary register to be clocked on the rising edge 
or the falling edge of the clock to support double precision operations (except 
multiplication) at the same rate as single precision operations. A feedback register with 
a separate clock is provided for temporary storage of a multiplier result, ALU result 
or constant. 

To ensure data integrity, parity checking is performed on input data, and parity is 
generated for output data. A master/slave comparator supports fault-tolerant system 
design. Two test pin control inputs allow all l/Os and outputs to be forced high, low, 
or placed in a high-impedance state to facilitate system testing. 

Floating point division using a Newton-Raphson algorithm can be performed in a sum- 
of -products operating mode, one of two modes in which the multiplier and ALU operate 
in parallel. Absolute value conversions, floating point to integer and integer to floating 
point conversions, and a compare instruction are also available. 

Data Flow p> 

oo 

Data enters the 'ACT8837 through two 32-bit input data buses, DA and DB. The buses CO 
can be configured to operate as a single 64-bit data bus for double precision operations ^ 
(see Table 7). Data can be latched in a 64-bit temporary register or loaded directly ^ 
into the RA and RB registers for input to the multiplier and ALU. ^" 

Four multiplexers select the multiplier and ALU operands from the input register, C Z 
register or previous multiplier or ALU result. Results are output on the 32-bit Y bus; 
a Y output multiplexer selects the most significant or least significant half of the result 
for output. The 64-bit C register is provided for temporary storage of a result from 
the ALU or multiplier. 

Input Data Parity Check 

When BYTEP is high, internal odd parity is generated for each byte of input data at 
the DA and DB ports and compared to the PA and PB parity inputs. If an odd number 
of bits is set high in a data byte, the parity bit for that byte is also set high. Parity 
bits are input on PA for DA data and PB for DB data. PAO and PBO are the parity bits 
for the least significant bytes of DA and DB, respectively. If the parity comparison 
fails for any byte, a high appears on the parity error output pin (PERRA for DA data 
and PERRB for DB data). 
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PARITY 
CHECK 



31-DAO DB31-DI 

TEMPORARY 
REGISTER 



PB 

i 



PARITY 
CHECK 



CONFIGURATION 
LOGIC 




> 
O 
H 
00 
00 
00 



SELMS/CS" 
FROM 

INSTRUCTION • • • 
REGISTER 



RA INPUT 
REGISTER 





MULTIPLIER CORE 



PIPELINE REGISTER 



ADDER/ROUNDER 



PRODUCT (P) REGISTER 



-*64 ,'64 




RB INPUT 
REGISTER 



35: 



3=i 





PIPELINE REGISTER 



NORMALIZER 



SUM (S) REGISTER 
_ 



STATUS REGISTER 



MASTER/ 

SLAVE 
COMPARE 



MSERR UNORO 
A GT B 
A EQ B 



- 19-10 
SELOP7 SELOPO 
PIPES2 PIPES 1 
FAST 

RND1-RND0 



—4- HALT 
— «~ BYTEP 
--4- CLK 
~4~ PIPESO 



4- CLKMODE 

RESET 

/ 2 4 TP1-TP0 
/ 10 4 VCC 
V 17 4 GNP 



- SELST1- 
SELSTO 



-4-6TS 
--4-015 



IVAL 

INEX 

OVER 

UNDER 

DENORM 

DENIN 

RNOCO 

SRCEX 

CHEX 

STEX1-STEX0 



Figure 1. 'ACT8837 Floating Point Unit 
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A parity check can also be performed on the entire input data word by setting BYTEP 
low. In this mode, PAO is the parity input for DA data and PBO is the parity input for 
DB data. 

Temporary Input Register 

A temporary input register is provided to enable double precision numbers on a single 
32-bit input bus to be loaded in one clock cycle. The contents of the DA bus are loaded 
into the upper 32 bits of the temporary register; the contents of DB are loaded into 
the lower 32 bits. A clock mode signal (CLKMODE) determines the clock edge on which 
the data will be stored in the temporary register. When CLKMODE is low, data is loaded 
on the rising edge of the clock; when CLKMODE is high, data is loaded on the falling 
edge. 

RA and RB Input Registers 

Two 64-bit registers, RA and RB, are provided to hold input data for the multiplier 
and ALU. Data is taken from the DA bus, DB bus and the temporary input register, 
according to configuration mode controls CONFIG1 -CONFIGO (see Tables 3 and 5). 
The registers are loaded on the rising edge of clock CLK. For single-precision operations, 
CONFIG 1 -CONFIGO should ordinarily be set to 0 1 (see Table 4). 



Table 3. Double-Precision Input Data Configuration Modes 



CONFIG 1 


CONFIGO 


LOADING SEQUENCE 


DATA LOADED INTO 
TEMP REGISTER ON FIRST 
CLOCK AND RA/RB 
REGISTERS ON SECOND 
CLOCK* 
DA DB 


DATA LOADED INTO 
RA/RB REGISTERS ON 
SECOND CLOCK 

DA DB 


0 


0 


B operand 


B operand 


A operand 


A operand 






(MSH) 


(LSH) 


(MSH) 


(LSH) 


0 


1 


A operand 


B operand 


A operand 


B operand 






(LSH) 


(LSH) 


(MSH) 


(MSH) 


1 


0 


A operand 


B operand 


A operand 


B operand 






(MSH) 


(MSH) 


(LSH) 


(LSH) 


1 


1 


A operand 


A operand 


B operand 


B operand . 






(MSH) 


(LSH) 


(MSH) 


(LSH) 



t On the first active clock edge (see CLKMODE, Table 1 7), data in this column is loaded into the temporary 
register. On the next rising edge, operands in the temporary register and the DA/DB buses are loaded into 
the RA and RB registers. 
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Table 4. Single-Precision Input Data Configuration Mode 



CONFIG 1 CONFIGO 


DATA LOADED INTO 
RA/RB REGISTERS ON 
FIRST CLOCK. 


NOTE 


DA 


DB 


0 1 


A operand 


B operand 


This mode is ordinarily 
used for single-precision 
operations. 


Table 5. Double-Precision Input Data Register Sources 


CONFIG 1 CONFIGO 


RA SOURCE 
MSH LSH 


RB SOURCE 
MSH LSH 


0 0 

0 1 

1 0 

1 1 


DA DB 

DA TEMP REG 
(MSH) 
TEMP REG DA 
(MSH) 

TEMP REG TEMP REG 
(MSH) (LSH) 


TEMP REG TEMP REG 
(MSH) (LSH) 
DB TEMP REG 
(LSH) 

TEMP REG DB 
(LSH) 
DA DB 



Multiplier/ALU Multiplexers 

Four multiplexers select the multiplier and ALU operands from the RA and RB registers, 
the previous multiplier or ALU result, or the C register. The multiplexers are controlled 
by input signals SELOP7-SELOP0 as shown in Tables 6 and 7. 



Table 6. Multiplier Input Selection 



A1 (MUX1) INPUT 


B1 (MUX2) INPUT 


SELOP7 SELOP6 


OPERAND SOURCE 


SELOP5 SEL0P4 


OPERAND SOURCE 


0 0 

0 1 

1 0 

1 1 


Reserved 
C register 
ALU feedback 
RA input register 


0 0 

0 1 

1 0 

1 1 


Reserved 
C register 
Multiplier feedback 
RB input register 



Table 7. ALU Input Selection 



A2 (MUX3) INPUT 


B2 (MUX4) INPUT 


SELOP3 SELOP2 


OPERAND SOURCE 


SELOP1 SELOPO 


OPERAND SOURCE 


0 0 

0 1 

1 0 

1 1 


Reserved 
C register 
Multiplier feedback 
RA input register 


0 0 

0 1 

1 0 

1 1 


Reserved 
C register 
ALU feedback 
RB input register 
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Pipelined ALU 

The pipelined ALU contains a circuit for addition and/or subtraction of aligned operands, 
a pipeline register, an exponent adjuster and a normalizer/rounder. An exception circuit 
is provided to detect denormal inputs; these can be flushed to zero if the fast input 
is set high. A denorm exception flag (DENORM) goes high when the ALU output is 
a denormal. 

The ALU may be operated independently or in parallel with the multiplier. Possible ALU 
functions during independent operation are given in Tables 8 and 9. Parallel 
ALU/multiplier functions are listed in Table 11. 

Pipelined Multiplier 

The pipelined multiplier performs a basic multiply function, A * B. The operands can 
be single-precision or double-precision numbers and can be converted to absolute values 
before multiplication takes place. Multiplier operations are summarized in Table 10. 

An exception circuit is provided to detect denormalized inputs; these are indicated 
by a high on the DENIN signal. 

The multiplier and ALU can be operated simultaneously by setting the 19 instruction 
input high. Possible operations in this chained mode are listed in Table 13. 

Product, Sum, and C Registers 

The results of the ALU and multiplier operations may optionally be latched into two 
output registers on the rising edge of the system clock (CLK). The P (product) register 
holds the result of the multiplier operation; the S (sum) register holds the ALU result. 

An additional 64-bit register is provided for temporary storage of the result of an ALU 
or multiplier operation before feedback to the multiplier or ALU. The data source for 
this C register is selected by SRCC; a high on this pin selects the multiplier result; 
a low selects the ALU. A separate clock, CLKC, has been provided for this register. 

Parity Generators 

Even parity is generated for the Y multiplexer output, either for each byte or for each 
word of output, depending on the setting of BYTEP. When BYTEP is high, the parity 
generator computes four parity bits, one for each byte of Y multiplexer output. Parity 
bits are output on the PY3-PY0 pins; PYO represents parity for the least significant 
byte. A single parity bit can also be generated for the entire output data word by setting 
BYTEP low. In this mode, PYO is the parity output. 
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Table 8. Independent ALU Operations, Single Operand (19 = 0, 16 = 0) 



VsrlMlnlE.U 


PRFPIQIOIM 


PRFPIQION 


01 itpi it 


nPFRAiun 

urcnHiNi/ 


ARQOI 1 ITF 


ALU OPERATION 


OPERATION 


DA 


RR 
rtD 


QOIJRPE 


TYPE 


VALUE A 






19 


18 


17 


16 


15 


14 


13-10 


RESULT 


0 = Not 


0 = A(SP) 


0 = B{SP) 


0 = ALU 


1 = Single 


0 - A 


0000 


Pass A operand 


Chained 


1 = A(DP) 


1 = B{DP) 


result 


Operand 


1 1 A 1 


0001 


Negate A operand 














0010 


Integer to floating point 
















conversion * 














0011 


Floating point to integer 
















conversion 














0100 


Undefined 














0101 


Undefined 














0110 


Floating point to floating 
















point conversion* 














0111 


Undefined 














1000 


Wrap (denormal) input 
















operand 














1001 


Undefined 














1010 


Undefined 














1011 


Undefined 














1100 


Unwrap exact number 














1101 


Unwrap inexact number 














1110 


Unwrap rounded input 














1111 


Undefined 



'The precision of the integer to floating point conversion is set by 18. 

*This converts single precision floating point to double precision floating point and vice versa. If the 18 pin is low to indicate a single-precision input, the result 
of the conversion will be double precision. If the 18 pin is high, indicating a double-precision input, the result of the conversion will be single precision. 



Table 9. Independent ALU Operations, Two Operands (19 = 0, 15 = 0) 



CHAINED 
OPERATION 
19 


PRECISION 
RA 
18 


PRECISION 
RB 
17 


OUTPUT 
SOURCE 
16 


OPERAND 
TYPE 

15 


ABSOLUTE 
VALUE A 

14 


ABSOLUTE 
VALUE B 
13 


ABSOLUTE 
VALUE Y 
12 


ALU OPERATION 


11-10 


RESULT 


0 .■= Not 


0 = A(SP) 


0 = B(SP) 


0 = ALU 


0 = Two 


0 = A 


0 = B 


0 - Y 


00 


A + B 


chained 


1 = A(DP) 


1 .= B(DP) 


result 


operands 


1 = |Aj. 


1 = |B| 


1 = |Y| 


01 


A - B 


















10 


Compare A, B 


















11 


B - A 



Table 10. Independent Multiplier Operations (19 - 0, 16 « 1) 



CHAINED 
OPERATION 
19 


PRECISION 
RA 
18 


PRECISION 
RB 
17 


OUTPUT 
SOURCE 
16 


15 


ABSOLUTE 
VALUE A 


ABSOLUTE 
VALUE B 
13* 


NEGATE 
RESULT 
I2t 


WRAP A 
11 


WRAP B 
10 


0 = Not 
chained 


0 = A(SP) 

1 = A{DP) 


0 = B(SP) 

1 = B(DP) 


1 = Multi- 
plier 
result 


0 


0 = A 

1 = iai; 


0 = B 

1 = |B| . 


0 = Y 

1 =,|Y| 


0 = Normal 
format 
1 = A is a 
wrapped 
number 


0 = Normal 
format 
1 = B is a 
wrapped 
number 



tSee Table 15. 
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Table 11. Independent Multiplier Operations Selected by I4-I2 (19 = 0, 16 = 1) 



ABSOLUTE 
VALUE A 
14 


ABSOLUTE 
VALUE B 
13 


NEGATE 
RESULT 
12 


OPERATION SELECTED 


14-12 


RESULTS 


0 = A 


0 = B 


0 = Y 


000 


A * B 


1 = |A| 


1 = |B| ■;. 


1 = -Y 


001 


-(A * B) 








010 


A * |B| 








01 1 


-(A * |B|) 








100 


|A| * B 








101 


~(|A| * B) 








110 


|A| *. |B| 








111 


-(|A| * |B|) 



Table 12. Operations Selected by 18-17 (19 = 0, 16 = 1) 



'(/> 
2 



PRECISION 
SELECT RA 
18 


PRECISION 
RA INPUT 


PRECISION 
SELECT RB 
17 


PRECISION 
RB INPUT 


PRECISION 
OF RESULT 


0 


Single 


0 


Single 


Single 




Single 








0 


Converted 


1 


Double 


Double 




to Double 














Single 




1 


Double 


0 


Converted 


Double 








to Double 




1 


Double 


1 


Double 


Double 



£ Master/Slave Comparator 

O A master/slave comparator is provided to compare data bytes from the Y output 
£q multiplexer and the status outputs with data bytes on the external Y and status ports 
00 when OEY # OES and OEC are high. If the data bytes are not equal, a high signal is 
generated on the master/slave error output pin (MSERR). 

Status and Exception Generator/Register 

A status and exception generator produces several output signals to indicate invalid 
operations as well as overflow, underflow, nonnumerical and inexact results, in 
conformance with IEEE Standard 754-1 985. If output registers are enabled 
(PIPES2 = 0), status and exception results are latched in a status register on the rising 
edge of the clock. Status results are valid at the same time that associated data results 
are v alid. Status outputs are enabled, by two signals, OEC for comparison status and 
OES for other status and exception outputs. Status outputs are summarized in 
Tables 14 and 1 5. 

During a compare operation in the ALU, the AEQB output goes high when the A and 
B operands are equal. When any operation other than a compare is performed, either 
by the ALU or the multiplier, the AEQB signal is used as a zero detect. 
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Table 13. Chained Multiplier/ALU Operations (19 = 1) 



CHAINED 


PRECISION 


PRECISION 


OUTPUT 




MULTIPLY 


NEGATE 


NEGATE MULTI- 




ALU 


OPERATION 


RA 


RB 


SOURCE 


ADD ZERO 


BY ONE 


ALU RESULT 


PLIER RESULT 


OPERATIONS 


19 


18 


17 


16 


15 


14 


13 


12 


11-10 


RESULT 


1 = Chained 


0 = A(SP) 


0 = B(SP) 


0 = ALU 


0 = Normal 


0 - Normal 


0 = Normal 


0 = Normal 


00 


A + B 




1 = A(DP) 


1 = B(DP) 


result 


operation 


operation 


operation 


operation 


or 


A - B 








1 = Multi- 


1 = Forces 


1 = Forces 


1 = Negate 


1 = Negate 


10 


2 - A 








plier 


B2 input 


B1 input 


ALU 


multiplier 


11 


B - A 








result 


of ALU 


of multi- 


result 


result 














to zero 


plier to 




















one 
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Table 14. Comparison Status Outputs 



SIGNAL 


RESULT OF COMPARISON (ACTIVE HIGH) 


AEQB 

AGTB 
UNORD 


The A and B operands are equal. (A high signal on the AEQB output indicates a 
zero result from the selected source except during a compare operation in the 
ALU.) 

The A operand is greater than the B operand. (Only during a compare operation 
in the ALU) 

The two inputs of a comparison operation are unordered, i.e., one or both of 
the inputs is a NaN. 



Table 15. Status Outputs 



SIGNAL 



STATUS RESULT 



CHEX 

DENIN 

DENORM 

I N EX 
IVAL 

OVER 
RNDCO 

SRCEX 

STEXO 
STEX1 
UNDER 



If 16 is low, indicates the multiplier is the source of an exception during a 
chained function. If 16 is high, indicates the ALU is the source of an exception 
during a chained function. 

Input to the multiplier is a denorm. When DENIN goes high, the STEX pins 
indicate which port had a denormal input. 

The multiplier output is a wrapped number or the ALU output is a denorm. In 
the FAST mode, this condition causes the result to go to zero. 

The result of an operation is not exact. 

A NaN has been input to the multiplier or the ALU, or an invalid operation 
(0 * oo or ± 00+ oo) has been requested. When IVAL goes high, the STEX 
pins indicate which port had a NaN. 

The result is greater than the largest allowable value for the specified format. 

The mantissa of a wrapped number has been increased in magnitude by 
rounding and the unwrap round instruction can be used to unwrap properly 
the wrapped number (see Table 8). 

The status was generated by the multiplier. (When SRCEX is low, the status 
was generated by the ALU.) 

A NaN or a denorm has been input on the B port. 

A NaN or a denorm has been input on the A port. 

The result is inexact and less than the minimum allowable value for the 
specified format. In the FAST mode, this condition causes the result to go to 
zero. 



5-36 



In chained mode, status results to be output are selected based on the state of the 
16 (source output) pin (if 16 is low, ALU status will be selected; if 16 is high, multiplier 
status will be selected). If the nonselected output source generates an exception, CHEX 
is set high. Status of the nonselected output source can be forced using the SELST 
pins, as shown in Table 16. 



Table 16. Status Output Selection (Chain Mode) 



SELST1- 
SELSTO 



STATUS SELECTED 



00 
01 
10 

11 



Invalid 

Selects multiplier status 
Selects ALU status 

Normal operation (selection based on result source specified by 16 input) 



Flowthrough Mode 

To enable the device to operate in pipelined or flowthrough modes, registers can be 
bypassed using pipeline control signals PIPES2-PIPES0 (see Table 17). 



Table 17. Pipeline Controls (PIPES2-PIPES0) 



PIPES2- 
PIPESO 


REGISTER OPERATION SELECTED 


X X 0 


Enables input registers (RA, RB) 


XX 1 


Disables input registers (RA, RB) 


X 0 X 


Enables pipeline registers 


X 1 X 


Disables pipeline registers 


0 X X 


Enables output registers (P, S, Status) 


1 X X 


Disables output registers (P, S, Status) 



FAST and IEEE Modes 

The device can be programmed to operate in FAST mode by asserting the FAST pin. 
In the FAST mode, all denormalized inputs and outputs are forced to zero. 

Placing a zero on the FAST pin causes the chip to operate in IEEE mode. In this mode, 
the ALU can operate on denormalized inputs and return denormals. If a denorm is input 
to the multiplier, the DENIN flag will be asserted, and the result will be invalid. If the 
multiplier result underflows, a wrapped number will be output. 



CO 

oo 
oo 
h- 
O 
< 

Z 
CO 
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Rounding Mode 

The 'ACT8837 supports the four IEEE standard rounding modes: round to nearest, 
round towards zero (truncate), round towards infinity (round up), and round towards 
minus infinity (round down). The rounding function is selected by control pins RND1 
and RNDO, as shown in Table 18. 



Table 18. Rounding Modes 



RND1- 


ROUNDING MODE SELECTED 


RNDO 




0 0 


Round towards nearest 


0 1 


Round towards zero (truncate) 


1 0 


Round towards infinity (round up) 


1 1 


Round towards negative infinity (round down) 



Test Pins 

Two pins, TP1.-TP0, support system testing. These may be used, for example, to place 
all outputs in a high-impedance state, isolating the chip from the rest of the system 
(see Table 1 9). 



CO 
Z 

> 

O 
H 
00 
00 
00 



TP1- 
TPO 



0 
0 

1 
1 



Table 19. Test Pin Control Inputs 



OPERATION 



All outputs and l/Os are forced low 

All outputs and l/Os are forced high 

All outputs are placed in a high impedance state 

Normal operation 



Summary of Control Inputs 

Control input signals for the 'ACT8837 are summarized in Table 20. 
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Table 20. Control Inputs 



SIGNAL 


HIGH 


LOW 


BYTEP 


Selects byte parity generation 
and test 


Selects single bit parity generation 
and test 


CLK 


Clocks all registers except C 


No effect 


CLKC 


Clocks C register 


No effect 


CLKMODE 


Enables temporary input register 
load on falling clock edge 


Enables temporary input register load 
on rising clock edge 


CONFIG1- 
CONFIGO 


See Table 3 (RA and RB register 
data source selects) 


See Table 3 (RA and RB register data 
source selects) 


ENRA 


If register is not in flow through, 
enables clocking RA register 


If register is not in flow through, holds 
contents of RA register 


ENRB 


If register is not in flow through, 
enables clocking of RB register 


If register is not in flow through, holds 
contents of RB register 


FAST 


Places device in FAST mode 


Places device in IEEE mode 


HALT 


No effect 


Stalls device operation but does not 
affect registers, internal states, or 
status 


OEC 


Disables compare pins 


Enables compare pins 


OES 


Disables status outputs 


Enables status outputs 


OEY 


Disables Y bus 


Enables Y bus 


PIPES2- 
PIPESO 


See Table 17 (pipeline mode 
control) 


See Table 17 (pipeline mode control) 


RESET 


No effect 


Clears internal states and status but 
does not affect data registers 


RND1- 
RNDO 


See Table 18 (rounding mode 
control) 


See Table 18 (rounding mode control) 


SEL0P7- 
SELOPO 


See Tables 6 and 7 
(multiplier/ALU operand selection) 


See Tables 6 and 7 (multiplier/ALU 
operand selection) 


SELMS/LS 


Selects MSH of 64-bit result for 
output on the Y bus 


Selects LSH of 64-bit result for output 
on the Y bus (no effect during single 
precision operation) 


SELST1- 
SELSTO 


See Table 1 5 (status output 
selection) 


See Table 1 5 (status output selection) 


SRCC 


Selects multiplier result for input 
to C register 


Selects ALU result for input to C 
register 


TP1-TP0 


See Table 1 9 (test pin control 
inputs) 


See Table 19 (test pin control inputs) 
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INSTRUCTION SET 



Configuration and operation of the 'ACT8837 can be selected to perform single- or 
double-precision floating-point calculations in operating modes ranging from 
flowthrough to fully pipelined. Timing and sequences of operations are affected by 
settings of clock mode, data and status registers, input data configurations/and 
rounding mode, as well as the instruction inputs controlling the ALU and the multiplier. 
The ALU and the multiplier of the 'ACT8837 can operate either independently or 
simultaneously, depending oh the setting of instruction inputs 19-10 and related controls. 

Controls for data flow and status results are discussed separately, prior to the 
discussions of ALU and multiplier operations. Then, in Tables 22 through 25, the 
instruction inputs to the ALU and the multiplier are summarized according to operating 
mode, whether independent or chained (ALU and multiplier in simultaneous operation). 

Loading External Data Operands 

Patterns of data input to the 'ACT8837 vary depending on the precision of the operands 
and whether they are being input as A or B operands. Loading of external data operands 
is controlled by the settings of CLKMODE and CONFIG 1-CONFIGO, which determine 
the clock timing and register destinations for data inputs. 

Configuration Controls (CONFIG 1 -CONFIGO) 

Ra Three input registers are provided to handle input of data operands, either single 
precision or double precision. The RA, RB, and temporary registers are each 64 bits 
wide. The temporary register is only used during input of double-precision operands. 

2 When single-precision or integer operands are loaded, the ordinary setting of CONFIG 1- 
^ CONFIGO is LH, as shown in Table 4. This setting loads each 32-bit operand in the 
> most significant half (MSH) of its respective register. The operands are loaded into 

the MSHs and adjusted to double precision because the data paths internal to the device 
Oo are all double precision. It is also possible to load single-precision operands with 
00 CONFIG 1-CONFIGO set to HH but two clock edges are required to load both the A 

and B operands on the DA bus. 

Double-precision operands are loaded by using the temporary register to store half 
of the operands prior to inputting the other half of the operands on the DA and DB 
buses. As shown in Tables 3 and 5, four configuration modes for selecting input sources 
are available for loading data operands into the RA and RB registers. 

CLKMODE Settings 

Timing of double-precision data inputs is determined by the clock mode setting, which 
allows the temporary register to be loaded on either the rising edge (CLKMODE = L) 
or the falling edge of the clock (CLKMODE = H). Since the temporary register is not 
used when single-precision operands are input, clock modes 0 and 1 are functionally 
equivalent for single-precision operations. 
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The setting of CLKMODE can be used to speed up the loading of double-precision 
operands. When the CLKMODE input is set high, data on the DA and DB buses are 
loaded on the falling edge of the clock into the MSH and LSH, respectively, of the 
temporary register. On the next rising edge, contents of the DA bus, DB bus, and 
temporary register are loaded into the RA and RB registers, and execution of the current 
instruction begins. The setting of CONFIG 1-CONFIGO determines the exact pattern 
in which operands are loaded, whether as MSH or LSH in RA or RB. 

Double-precision operation in clock mode 0 is similar except that the temporary register 
loads only on a rising edge. For this reason the RA and RB registers do not load until 
the next rising edge, when all operands are available and execution can begin. 

A considerable advantage in speed can be realized by performing double-precision ALU 
operations with CLKMODE set high. In this clock mode both double-precision operands 
can be loaded on successive clock edges, one falling and one rising, and the ALU 
operation can be executed in the time from one rising edge of the clock to the next 
rising edge. Both halves of a double-precision ALU result must be read out on the Y 
bus within one clock cycle when the 'ACT8837 is operated in clock mode 1. 

Internal Register Operations 

Six data registers in the 'ACT8837 are arranged in three levels along the data paths 
through the multiplier and the ALU. Each level of registers can be enabled or disabled 
independently of the other two levels by setting the appropriate PIPES2-PIPES0 inputs. H| 

The RA and RB registers receive data inputs from the temporary register and the DA 

and DB buses. Data operands are then multiplexed into the multiplier, ALU, or both. jn» 

To support simultaneous pipelined operations, the data paths through the multiplier J£ 

and the ALU are both provided with pipeline registers and output registers. The control g) 

settings for the pipeline and output registers (PIPES2-PIPES1 ) are registered with the h- 

instruction inputs I9-I0. ^ 

A seventh register, the constant (C) register is available for storing a 64-bit constant [5! 
or an intermediate result from the multiplier or the ALU. The C register has a separate 2 
clock input (CLKC) and input source select (SRCC). The SRCC input is not registered & 
with the instruction inputs. Depending on the operation selected and the settings of 
PIPES2-PIPES0, an offset of one or more cycles may be necessary to load the desired 
result into the C register. 

Status results are also registered whenever the output registers are enabled. Duration 
and availability of status results are affected by the same timing constraints that apply 
to data results on the Y output bus. 

Data Register Controls (PIPES2-PIPES0) 

Table 1 7 shows the settings of the registers controlled by PIPES2-PIPES0. Operating 
modes range from fully pipelined (PIPES2-PIPES0 = LLL) to flowthrough 
(PIPES2-PIPES0 = HHH). 
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In flowthrough mode all three levels of registers are disabled, a circumstance which 
may affect some double-precision operations. Since double-precision operands require 
two steps to input, at least half of the data must be clocked into the temporary register 
before the remaining data is placed on the DA and DB buses. 

When all registers (except the C register) are enabled/timing constraints can become 
critical for many double-precision operations. In clock mode 1 , the ALU can perform 
a double-precision operation and output a result during every clock cycle, and both 
halves of the result must be read out before the end of the next cycle. Status outputs 
are valid only for the period during which the Y output data is valid. 

Similarly, double-precision multiplication is affected by pipelining, clock mode, and 
sequence of operations. A double-precise multiply requires two cycles to execute, 
depending on the settings of PIPES2-PIPES0, The output may be valid for one or two 
cycles, depending on the precision of the next operation. 

Duration of valid outputs at the Y multiplexer depends on settings of PIPES2-PIPES0 
and CLKMODE, as well as whether all operations and operands are of the same type. 
For example, when a double-precision multiply is followed by a single-precision 
operation, one open clock cycle must intervene between the dissimilar operations. 

C Register Controls (SRCC, CLKC) 

wm The C register loads from the P or the S register output, depending on the setting of 
SRCC, the load source select. SRCC = H selects the multiplier as input source. 
™ Otherwise the ALU is selected when SRCC = L. In either case the C register only loads 
(/) the selected input on a rising edge of the CLKC signal. 
Z 

vj The C register does not load directly from an external data bus. One method for loading 
^ a constant without wasting a cycle is to input the value as an A operand during an 
O operation which uses only the ALU or multiplier and requires no external data inputs. 
H Since the B operand can be forced to zero in the ALU or to one in the multiplier, the 
go A operand can be passed to the C register either by adding zero or multiplying by one, 
CO then selecting the input source with SRCC and causing the CLKC signal to go high. 
^ Otherwise, the C register can be loaded through the ALU with the Pass A Operand 
instruction, which requires a separate cycle. 

Operand Selection (SELOP7-SELOP0) 

As shown in Tables 6 and 7, data operands can be selected as five possible sources, 
including external inputs from the RA and RB registers, feedback from the P and S 
registers, and a stored value in the C register. Contents of the C register may be selected 
as either the A or the B operand in the ALU, the multiplier, or both. When an external 
input is selected, the RA input always becomes the A operand, and the RB input is 
the B operand. 
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Feedback from the ALU can be selected as the A operand to the multiplier or as the 
B operand to the ALU. Similarly, multiplier feedback may be used as the A operand 
to the ALU or the B operand to the multiplier. 

Selection of operands also interacts with the selected operations in the ALU or the 
multiplier. ALU operations with one operand are performed only on the A operand. 
Also, depending on the instruction selected, the B operand may optionally be forced 
to zero in the ALU or to one in the multiplier. 

Rounding Controls (RND1-RND0) 

Because floating point operations may involve both inherent and procedural errors, 
it is important to select appropriate modes for handling rounding errors. To support 
the IEEE standard for binary floating-point arithmetic, the 'ACT8837 provides four 
rounding modes selected by RND1-RND0. 

Table 1 8 shows the four selectable rounding modes. The usual default rounding mode 
is round to nearest (RND1-RND0 = LL). In round-to-nearest mode, the 'ACT8837 
supports the IEEE standard by rounding to even (LSB = 0) when two nearest 
representable values are equally near. Directed rounding toward zero, infinity, or minus 
infinity are also available. 

Rounding mode should be selected to minimize procedural errors which may otherwise 
accumulate and affect the accuracy of results. Rounding to nearest introduces a 
procedural error not exceeding half of the least significant bit for each rounding 
operation. Since rounding to nearest may involve rounding either upward or downward 
in successive steps, rounding errors tend to cancel each other. 

In contrast, directed rounding modes may introduce errors approaching one bit for 
each rounding operation. Since successive rounding operations in a procedure may 
all be similarly directed, each introducing up to a one-bit error, rounding errors may 
accumulate rapidly, especially in single-precision operations. 

Status Exceptions 

Status exceptions can result from one or more error conditions such as overflow, 
underflow, operands in illegal formats, invalid operations, or rounding. Exceptions may 
be grouped into two classes: input exceptions resulting from invalid operations or 
denormal inputs to the multiplier, and output exceptions resulting from illegal formats, 
rounding errors, or both. 

To simplify the discussion of exception handling, it is useful to summarize the data 
formats for representing IEEE floating-point numbers which can be input to or output 
from the FPU (see Table 21 ). Since procedures for handling exceptions vary according 
to the requirements of specific applications, this discussion focuses on the conditions 
which cause particular status exceptions to be signalled by the FPU. 
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Table 21 . IEEE Floating-Point Representations 



TYPE OF 


EXPONENT (e) 


FRACTION (f) 


HIDDEN 


VALUE OF NUMBER REPRESENTED 


OPERAND 


SP(HEX) 


DP (HEX) 


(BINARY) 


BIT 


SP (DECIMAL) f 


DP (DECIMAL) t 


Normalized 
Number (max) 


FE 


7FE 


All 1 's 


1 


(-1)S (2127) (2-2-23) 


/ 1 \s 


/o1023\ to o - 52\ 


Normalized 
Number (min) 


01 


001 


Ail O's 


1 


(-1) s (2 - 1 26) (1) 


(-D s 


(2-1022) {1 ) 


Denormalized 
Number (max) 


00 


000 


All 1's 


0 


(1 -) s (2~ 12 6) (1 - 2-23) 


(-D s 


(2-1022) (1 -2-52) 


Denormalized 
Number (min) 


00 


000 


000.. .001 


0 


( -1,s (2-126) (2-23) 


(-D s 


(2-1022) (2- 52) 


Wrapped 
Number (max) 


00 


000 


All 1's 


1 


(-1)S (2-127) (2-2-23) 


(-1) s 


(2-1023) (2-2-52) 


Wrapped 
Number (min) 


EA 


7 CD 


AW O's 


1 


(-1)S (2-22+127) (1) 


(-D s 


(2-51 +1023) (1) 


Zero 


00 


000 


Zero 


0 


(-1) s (0.0) 


(-D s 


(0.0) 


Infinity 


FF 


7FF 


Zero 


1 


( - 1 ) s (infinity) 


<-D s 


(infinity) 


NAN (Not a 
Number 


FF 


7FF 


Nonzero 


N/A 


None 


None 





s = sign bit 



IEEE formats for floating-point operands, both single and double precision, consist of 
three fields: sign, exponent, and fraction, in that order. The leftmost (most significant) 
bit is the sign bit. The exponent field is eight bits long in single-precision operands 
and 1 1 bits long in double-precision operands. The fraction field is 23 bits in single 
precision and 52 bits in double precision. Further details of IEEE formats and exceptions 
are provided in the IEEE Standard for Binary Floating-Point Arithmetic, ANSI/IEEE 
Std 754-1985. 

Several status exceptions are generated by illegal data or instruction inputs to the FPU. 
Input exceptions may cause the following signals to be set high: IVAL, DENIN, and 
STEX1-STEX0. If the IVAL flag is set, either an invalid operation has been requested 
or a NaN (Not a Number) has been input. When DENIN is set, a denormalized number 
has been input to the multiplier. STEX1-STEX0 indicate which port (RA, RB, or both) 
is the source of the exception when either a denormal is input to the multiplier 
(DENIN = H) or a NaN (IVAL = H) is input to the multiplier or the ALU. 

NaN inputs are all treated as IEEE signaling NaNs, causing the IVAL flag to be set. 
When output from the FPU, the fraction field from a NaN is set high (all 1 's), regardless 
of the original fraction field of the input NaN. 

Output exception signals are provided to indicate both the source and type of the 
exception. DENORM, INEX, OVER, UNDER, and RNDCO indicate the exception type, 
and CHEX and SRCEX indicate the source of an exception. SRCEX indicates the source 
of a result as selected by instruction bit 16, and SRCEX is active whenever a result 
is output, not only when an exception is being signaled. The chained-mode exception 
signal CHEX indicates that an exception has be generated by the source not selected 
for output by 16. The exception type signaled by CHEX cannot be read unless status 
select controls SELST1 -SELSTO are be used to force status output from the deselected 
source. 

Output exceptions may be due either to a result in an illegal format or to a procedural 
error. Results too large or too small to be represented in the selected precision are 
signalled by OVER and UNDER. Any ALU output which has been increased in magnitude 
by rounding causes INEX to be set high. DENORM is set when the multiplier output 
is wrapped or the ALU output is denormalized. Wrapped outputs from the multiplier 
may be inexact or increased in magnitude by rounding, which may cause the INEX 
and RNDCO status signals to be set high. A denormal output from the ALU 
(DENORM = H) may also cause INEX to be set, in which case UNDER is also signalled. 

Handling of Denormalized Numbers (FAST) 

The FAST input selects the mode for handling denormalized inputs and outputs. When 
the FAST input is set low, the ALU accepts denormalized inputs but the multiplier 
generates an exception when a denormal is input. When FAST is set high, the DENIN 
status exception is disabled and all denormalized numbers, both inputs and results, 
are forced to zero. 
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A denormalized input has the form of a floating-point number with a zero exponent, 
a nonzero mantissa, and a zero in the leftmost bit of the mantissa (hidden or implicit 
bit). A denormalized number results from decrementing the biased exponent field to 
zero before normalization is complete. Since a denormalized number cannot be input 
to the multiplier, it must first be converted to a wrapped number by the ALU. When 
the mantissa of the denormal is normalized by shifting it left, the exponent field 
decrements from all zeros (wraps past zero) to a negative two's complement number 
(except in the case of .IXXX. . . , where the exponent is not decremented). 

Exponent underflow is possible during multiplication of small operands even when the 
operands are not wrapped numbers. Setting FAST = L selects gradual underflow so 
that denormal inputs can be wrapped and wrapped results are not automatically 
discarded. When FAST is set high, denormal inputs and wrapped results are forced 
to zero immediately. 

When the multiplier is in IEEE mode and produces a wrapped number as its result, 
the result may be passed to the ALU and unwrapped. If the wrapped number can be 
unwrapped to an exact denormal, it can be output without causing the underflow status 
flag (UNDER) to be set. UNDER goes high when a result is an inexact denormal, and 
a zero is output from the FPU if the wrapped result is too small to represent as a 
denormal (smaller than the minimum denorm). Table 22 describes the handling of 
wrapped multiplier results and the status flags that are set when wrapped numbers 
are output from the multiplier. 



Table 22. Handling Wrapped Multiplier Outputs 



TYPE 
OF RESULT 


STATUS FLAGS SET 
DENORM INEX RNDCO UNDER 


NOTES 


Wrapped, 
exact 
Wrapped, 
inexact 
Wrapped, 
increased in 
magnitude by 
rounding 


1 0 0 0 
1 1 0 1 

1 1 1 1 


Unwrap with 'Wrapped 
exact' ALU instruction 
Unwrap with 'Wrapped 
inexact' ALU instruction 

Unwrap with 'Wrapped 
rounded' ALU instruction 



When operating in chained mode, the multiplier may output a wrapped result to the 
ALU during the same clock cycle that the multiplier status is output. In such a case 
the ALU cannot unwrap the operand prior to using it, for example, when accumulating 
the results of previous multiplications. To avoid this situation, the FPU can be operated 
in FAST mode to simplify exception handling during chained operations. Otherwise, 
wrapped outputs from the multiplier may adversely affect the accuracy of the chained 
operation, because a wrapped number may appear to be a large normalized number 
instead of a very small denormalized number. 
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Because of the latency associated with interpreting the FPU status outputs and 
determining how to process the wrapped output, it is necessary that a wrapped operand 
be stored external to the FPU (for example/ in an external register file) and reloaded 
to the A port of the ALU for unwrapping and further processing. 



Data Output Controls (SELMS/LS, OEY) 

Selection and duration of results from the Y output multiplexer may be affected by 
several factors, including the operation selected, precision of the operands, registers 
enabled, and the next operation to be performed. The data output controls are not 
registered with the data and instruction inputs. When the device is microprogrammed, 
the effects of pipelining and sequencing of operations should be taken into account. 

Two particular conditions need to be considered. Depending on which registers are 
enabled, an offset of one or more cycles must be allowed before a valid result is available 
at the Y output multiplexer. Also, certain sequences of operations may require both 
halves of a double-precision result to be read out within a single clock cycle. This is 
done by toggling the SELMS/LS signal in the middle of the clock period. 

When a single-precision result is output, the SELMS/LS signal has no effect. The 
SELMS/LS signal is set low only to read out the LSH of a double-preci sion result. 
Whenever this signal is selecting a valid result for output on the Y bus, the OEY enable 
must be pulled low at the beginning of that clock cycle. 



Status Output Controls (SELST1-SELST0, OES, OEC) 

Ordinarily, SELST1 -SELSTO are set high so that status selection defaults to the output ^ 

source selected by instruction input 16. The ALU is selected as the output source when (*) 

16 is tow, and the multiplier when 16 is high. 5? 

00 

When the device operates in chained mode, it may be necessary to read the status q 

results not associated with the output source. As shown in Table 1 6, SELST1 -SELSTO ^ 
can be used to read the status of either the ALU or the multiplier regardless of the 

16 setting. 2* 



Status results are registered only when the output (P and S) registers are enabled 
(PIPES2 = L). Otherwise, the status register is transpa rent. In either case, status 
outputs can be read by pulling the output enables low (OES, OEC, or both). 



Stalling the Device (HALT) 



Operation of the ' ACT88 37 can be stalled nondestructiveiy by means of the HALT 
signal. Pulling the HALT input low causes the device to stall on the next low level 
of the clock. Register contents are unaltered when the devic e is stalled, and normal 
operat ion resumes at the next low clock period after the HALT signal is set high. Using 
HALT in microprograms can save power, especially using high clock frequencies and 
pipelined stages. 



if) 
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For some operations, such as a double-precision multiply with CLKMODE = 1, setting 
the HALT input low may interrupt loading of the RA, RB, and instruction registers, 
as well as stalling operation. In clock mode 1 , the temporary register loads on the falling 
edge of the clock, but the HALT signal going low would prevent the RA, RB, and 
instruction registers from loading on the next rising clock edge. It is th erefore necessary 
to have the instruction and data inputs on the pins when the HALT signal is set high 
again and normal operation resumes. 

Instruction Inputs (I9-I0) 

Three modes of operation can be selected with inputs I9-I0, including independent 
ALU operation, independent multiplier operation, or simultaneous (chained) operation 
of ALU and multiplier. Each operating mode is treated separately in the following 
sections. 

Independent ALU Operations 

The ALU executes single- and double-precision operations which can be divided 
according to the number of operands involved, one or two. The ALU accepts integer, 
normalized, and denormalized numbers as operands. Table 22 shows independent ALU 
operations with one operand, along with the inputs I9-I0 which select each operation. 
Conversions from one format to another are handled in this mode, with the exception 
of adjustments to precision during two-operand ALU operations. Wrapping and 
PI unwrapping of operands is also done in this mode. 

HI Table 24 presents independent ALU operations with two operands. When the operands 
m are different in precision, one single and the other double, the settings of the precision- 

2 selects I8-I7 will identify the single-precision operand so that it can automatically be 
reformatted to double precision before the selected operation is executed, and the 

^ result of the operation will be double precision. 

3 Independent Multiplier Operations 
00 

00 In this mode the multiplier operates on the RA and RB inputs which can be either single 
J*j precision, double precision, or mixed. Operands may be normalized or wrapped 
numbers, as indicated by the settings for instruction inputs 11-10. As shown in Table 25, 
the multiplier can be set to operate on the absolute value of either or both operands, 
and the result of any operation can be negated when it is output from the multiplier. 
Converting a single-precision denormal number to double precision does not normalize 
or wrap the denormal, so it is still an invalid input to the multiplier. 
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Table 23. Independent ALU Operations with One Operand 



ALU OPERATION 
ON A OPERAND 


INSTRUCTION 
INPUTS 19-10 


NOTES 


Pass A operand 


Ox 001 x 0000 




Negate A operand 
Convert from integer to 
floating point * 


0x001x0001 
Ox 0010 0010 




Convert from floating 
point to integer 
Undefined 
Undefined 

Convert from floating 
point to floating point 
(adjusts precision of 
input: SP-DP, DP-*SP) 


Ox 001 x 0011 
Ox 001 x 0100 
Ox 001 x 0101 
0x001x0110 


x = Don't care 

I.8 selects precision of A operand: 

0 = A (SP) 

1 = A (DP) 

14 selects absolute value of A operand: 
0 = A 


Undefined 


Ox 001 x 0111 


1 = | A | 


Wrap denormal operand 
Undefined 


Ox 001 x 1000 
Ox 001 x 1001 


During integer to floating point conversion, 
| A | is not allowed as a result. 


Undefined 


Ox 001 x 1010 




Undefined 


Ox 001x 1011 




Unwrap exact number 


Ox 001x 1100 




Unwrap inexact number 


0x001x1101 




Unwrap rounded input 


Ox 001x 1110 




Undefined 


Ox 001x 1111 





During this operation, 18 selects precision of the result. 00 

00 
H 

o 
< 

■ ■ ■ ■ ■ ■ ■ ■ z 
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Table 24. Independent ALU Operations with Two Operands 



Al 1 1 OPFRATIDIVIQ 




NOTES 




IMPI ITQ IQ m 


Add A + B 


Ox xOOO 0x00 




A -1 -1 1 A 1 i O 

Add |A| 4- B 


Ox x001 0x00 




Add A + | D l 


Ox xOOO 1x00 




Add lAl + IBI 


Oy v001 1 yOO 


v = Dnn't r*arp 


Suhtrart A — R 


Oy yOOO Oy01 




Subtract lAl — R 


Oy y001 0y01 

v A Avv 1 VAv 1 


0 = A (SP) 


Subtract A — 1 B 1 


Oy yOOO 1 y01 


1 = A (DPI 


Subtract 1 A ! — IB 


Ox x001 1x01 


17 <»p|prt^ nrpci<5inn r>f R onprand* 


Compare A, B 


Ox xOOO 0x10 


0 = B (SP) 


Compare |A|, B 


Ox x001 0x10 


1 = B (DP) 


Compare A, |B| 


Ox xOOO 1x10 


12 selects either Y or its absolute value: 


Compare |A|, |B| 


Ox x001 1x10 


0 = Y 


Subtract B - A 


0xx000 0x11 


1 = |Y| 


Subtract B — | A | 


Ox x001 0x11 




Subtract |B| - A 


Ox xOOO 1x11 




Subtract |B| - |A| 


Ox x001 1x11 





5 



Table 25. Independent Multiplier Operations 



MULTIPLIER OPRATION 
AND OPERANDS 


INSTRUCTION 
INPUTS 19-10 


NOTES 


Multiply A * B 
Multiply -(A * B) 
Multiply A * |'B| 
Multiply -(A.* |B|) 
Multiply | A | * B 
Multiply -(|A| * B) 
Multiply | A | * |B| 
Multiply -(|A| * |B|) 


Ox x100 OOxx 
Ox x1 00 01 xx 
Ox x100 10xx 
Ox x100 1 1xx 
Ox x101 OOxx 
0xx10101xx 
Ox x101 10xx 
Ox x101 1 1xx 


x = Don't Care 
18 selects A operand 
precision (0 = SP, 1 = DP) 
17 selects B operand 
precision (0 = SP, 1 = DP) 
11 selects A operand format 
(0 = Normal, 1 = Wrapped) 
10 selects B operand format 
(0 = Normal, 1 = Wrapped) 
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Chained Multiplier/ALU Operations 

In chained mode, the 'ACT8837 performs simultaneous operations in the multiplier 
and the ALU. Operations include addition, subtraction, and multiplication, except 
multiplication of wrapped operands. Several optional operations also increase the 
flexibility of the device. 

The B operand to the ALU can be set to zero so that the ALU passes the A operand 
unaltered. The B operand to the multiplier can be forced to the value 1 so that the 
A operand to the multiplier is passed unaltered (see Table 26). 



Table 26. Chained Multiplier/ALU Operations 



CHAINED OPERATIONS 


OUTPUT 
SOURCE 


INSTRUCTION 

mini ito m 1 1\ 

INPUTS 19-10 


NOTES 


Rill II TIDI ICD 

iVIUL 1 IrLltrf 


Al I I 

ALU 


A * D 


A i D 
A + D 


A 1 II 

ALU 


I X xuuu xxuu 




A * B 


A + B 


Multiplier 


1 x x 1 00 xxOO 




A * B 


A - B 


ALU 


1x xOOO xx01 




A * B 


A - B 


Multiplier 


1x x100 xx01 




A * B 


2 - A 


AMI 
MLU 


1 v vDnn w 1 n 

I X XUUU XX I u 


x = Don't Care 


A * B 


9 — A 


Multiplier 


1 X X 1 UU XX 1 u 


18 selects precision of 


A -* B 


B — A 


All 1 
ALU 


I X XUUU XX I I 


RA inputs: 


A * B 


B - A 


Multiplier 


1 v v 1 r\f\ vvi 1 

I X X I UU XX I I 


Ci — DA /QP\ 
U — nA (or) 


A * B 


A + 0 


ALU 


1x x010 xxOO 


1 = RA (DP 


A-.* B 


A + 0 


Multiplier 


1x x1 10 xxOO 


17 selects precision of 


A * B 


0 - A 


ALU 


1x x010 xx1 1 


RB inputs: 


A * B 


0 - A 


Multiplier 


1x x1 10 xx 1 1 


0 =■ RB (SP) 


A * 1 


A + B 


ALU 


1x x001 xxOO 


1 = RB (DP) 


A * 1 


A + B 


Multiplier 


1x x101 xxOO 


13 negates ALU result: 


A * 1 


A - B 


ALU 


1x x001 xx01 


0 - Normal 


A * 1 


A - B 


Multiplier 


1x x101 xx01 


1 = Negated 


A * 1 


2 - A 


ALU 


1x x001 xx10 


12 negates multiplier 


A * 1 


2 - A 


Multiplier 


1x x101 xx10 


result: 


A * 1 


B - A 


ALU 


1x x001 xx1 1 


0 = Normal 


A ■■* 1 


B - A 


Multiplier 


1x x101 xx1 1 


1 = Negated 


A *. 1 


A + 0 


ALU 


1x x01 1 xxOO 




A * 1 


A + 0 


Multiplier 


1x x1 1 1 xxOO 




A * 1 


0 - A 


ALU 


1x x01 1 xx1 1 




A * 1 


0 - A 


Multiplier 


1x x1 1 1 xx1 1 





5-51 



MICROPROGRAMMING THE 'ACT8837 

Because the 'ACT8837 is microprogrammable, it can be configured to operate on either 
single- or double-precision data operands, and the operations of the registers, ALU, 
and multiplier can be programmed to support a variety of applications. The following 
examples present not only control settings but the timings of the specific operations 
required to execute the sample instructions. 

Timing of the sample operations varies with the precision of the data operands and 
the settings of CLKMODE and PIPES. Microinstructions and timing waveforms are given 
for all combinations of data precision, clock mode, and register settings. Following 
the presentation of ALU and multiplier operations is a brief sum-of-products operation 
using instructions for chained operating mode. 

Single-Precision Operations 

Two single-precision operands can be loaded on the 32-bit input buses without use 
of the temporary register so CLKMODE has no effect on single-precision operation. 
Both the ALU and the multiplier execute all single-precision instructions in one clock 
cycle, assuming that the device is not operating in flowthrough mode (all registers 
disabled). Settings of the register controls PIPES2-PIPES0 determine minimum cycle 
time and the rate of data throughput, as evident from the examples below. 

Single-Precision ALU Operations 

Precision of each data operand is indicated by the setting of instruction input 18 for 
single-operand ALU instructions, or the settings of 18-17 for two-operand instructions. 
When the ALU receives mixed-precision operands (one operand in single precision and 
the other in double precision), the single-precision data input is converted to double 
and the operation is executed in double precision. 

If both operands are single precision, a single-precision result is output by the ALU. 
Operations on mixed-precision data inputs produce double-precision results. 

It is unnecessary to use the 'convert float-to-float' instruction to convert the single- 
precision operand prior to performing the desired operation on the mixed-precision 
operands. Setting 18 and 17 properly achieves the same effect without wasting an 
instruction cycle. 

Single-Precision Multiplier Operations 

Operand precision is selected by 18 and 17, as for ALU operations. The multiplier can 
multiply the A and B operands, either operand with the absolute value of the other, 
or the absolute values of both operands. The result can also be negated when it is 
output. If both operands are single precision, a single-precision result is output. 
Operations on mixed-precision data inputs produce double-precision results. 
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Sample Single-Precision Microinstructions 

The following four single-precision microinstruction coding examples show the four 
register settings, ranging from flowthrough to fully pipelined. Timing diagrams 
accompany the sample microinstructions. 

In the first example PIPES2-PIPES0 are all set high so the internal registers are all 
disabled. This microinstruction sets up a wrapped result from the multiplier to be 
unwrapped by the ALU as an exact denormalized number. In flowthrough mode the 
'unwrap exact' operation is performed without a clock as soon as the instruction is 
input. Single-precision timing in flowthrough mode is shown in Figure 2. 

CLKMODE = 0 PIPES = 111 Operation: Unwrap A operand exact 
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Figure 2. Single-Precision Operation, All Registers Disabled 
(PIPES « 111, CLKMODE « 0) 
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The second example shows a microinstruction causing the ALU to compare absolute 
values of A and B. Only the input registers are enabled (PIPES2-PIPES0 = 1 10) so 
the result is output in one clock cycle. 

CLKMODE = 0 PIPES = 110 Operation: Compare |A|,|B| 
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Figure 3. Single-Precision Operation, Input Registers Enabled 
(PIPES - 110/ CLKMODE - 0) 
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Input and output registers are enabled in the third example, which shows the subtraction 
B - A. Two clock cycles are required to load the operands, execute the subtraction, 
and output the result (see Figure 4). 

CLKMODE = 0 PIPES =010 Operation: Subtract B - A 
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Figure 4. Single-Precision Operation, Input and Output Registers Enabled 
(PIPES - 010, CLKMODE - 0) 
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The fourth example shows a multiplication A * B with all registers enabled. Three 
clock cycles are required to generate and output the product. Once the internal registers 
are all loaded with data or results, a result is available from the output register on every 
rising edge of the clock. The floating point unit produces its highest throughput when 
operated fully pipelined with single-precision operands. 

CLKMODE = 0 PIPES = 000 Operation: Multiply A * B 
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Figure 5. Single-Precision Operation, All Registers Enabled 
(PIPES = 000, CLKMODE « 0) 
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Double-Precision Operations 

Double-precision operations may be executed separately in the ALU or the multiplier, 
or simultaneously in both. Rates of execution and data throughput are affected by 
the settings of the register controls (PIPES2-PIPES0) and the clock mode (CLKMODE). 

The temporary register can be loaded on either the rising edge (CLKMODE = L) or 
the falling edge of the clock (CLKMODE = H). Double-precision operands are always 
loaded by using the 64-bit temporary register to store half of the operands prior to 
inputting the other half of the operands on the DA and DB buses. 

Input configuration is selected by CONFIG 1-CONFIGO, allowing several options for 
the sequence in which data operands are set up in the temporary register and the RA 
and RB registers. Operands are then sent to either the ALU or multiplier, or both, 
depending on the settings for SELOP 7-0. 

The ALU executes all double-precision operations in a single clock cycle. The multiplier 
requires two clock cycles to execute a double-precision operation. When the device 
operates in chained mode (simultaneous ALU and multiplier operations), the chained 
double-precision operation is executed in two clock cycles. The settings of 
PIPES2-PIPES0 determine whether the result is output without a clock (flowthrough) 
or after up to five clocks for a double-precision multiplication (all registers enabled 
and CLKMODE = L). 

HJ Double-Precision ALU Operations 

Eight examples are provided to illustrate microinstructions and timing for double- 
2 precision ALU operations. The settings of CLKMODE and PIPES2-P.IPES0 determine 
>j how the temporary register loads and which registers are enabled. Four examples are 
^ provided in each clock mode. 

^ Double-Precision ALU Operations with CLKMODE = 0 

00 The first example shows that, even in flowthrough mode, a clock signal is needed 
GO to load the temporary register with half the data operands (see Figure 6). The selected 
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operation is executed without a clock after the remaining half of the data operands 
are input on the RA and RB buses: 



CLKMODE = 0 PIPES =111 Operation: Add A + |B| 
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Figure 6. Double-Precision ALU Operation, All Registers Disabled 
(PIPES -111, CLKMODE « 0) 
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In the second example the input register is enabled (PIPES2-PIPES0 = 110). Operands 
A and B for the instruction, |B| - |A| , are loaded using CONFIG = 00 so that B is 
loaded first into the temporary register with MSH through the DA port and LSH through 
the DB port. On the second clock rising edge, the A operand is loaded in the same 
order directly to RA register while B is loaded from the temporary register to the RB 
register (see Figure 7). 

CLKMODE = 0 PIPES = 110 Operation: |B| - |A| 
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Both the input and output registers are enabled (PIPES2-PIPES0 010) in the third 
example. The instruction sets up the ALU to wrap a denormalized number on the DA 
input bus. The wrapped output can be fed back from the S register to the multiplier 
input multiplexer by a later microinstruction. Timing for this operation is shown in 
Figure 8. 

CLKMODE = 0 PIPES = 010 Operation: Wrap Denormal Input 
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In the fourth example with CLK MODE = L, all three levels of internal registers are 
enabled. The instruction converts a double-precision integer operand to a double- 
precision floating-point operand. Figure 9 shows the timing for this operating mode. 

CLKMODE = 0 PIPES = 000 Operation: Convert Integer to Floating Point 
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Figure 9. Double-Precision ALU Operation, All Registers Enabled 
(PIPES - 000, CLKMODE - 0) 
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Double-Precision ALU Operations with CLKMODE = 1 

The next four examples are similar to the first four except that CLKMODE = H so that 
the temporary register loads on the falling edge of the clock. When the ALU is operating 
independently, setting CLKMODE high enables loading of both double-precision 
operands on successive falling and rising clock edges. 

In this clock mode a double-precision ALU operation requires one clock cycle to load 
data inputs and execute, and both halves of the 64-bit result must be read out on 
the 32-bit Y bus within one clock cycle. The settings of PIPES2-PIPES0 determine 
the number of clock cycles which elapse between data input and result output. 

In the first example all registers are disabled (PIPES2-PIPES0 = 111), and the addition 
is performed in flowthrough mode. As shown in Figure 10, a falling clock edge is needed 
to load half of the operands into the temporary register prior to loading the RA and 
RB registers on the next rising clock. 

CLKMODE = 1 PIPES = 111 Operation: Add A + |B| 
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INSTRUCTION: FUNC(9,0), RND(1,0). FAST 
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Figure 10. Double-Precision ALU Operation, All Registers Disabled 
(PIPES - 111, CLKMODE - 1) 
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The second example executes subtraction of absolute values for both operands. Only 
the RA and RB registers are enabled (PIPES2-PIPES0 = 1 10). Timing is shown in 
Figure 11. 

CLKMODE = 1 PIPES = 110 Operation: Subtract J B| - |A| 
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Figure 1 1 . Double-Precision ALU Operation, Input Registers Enabled 
(PIPES - 110, CLKMODE - 1) 
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The third example shows a single denormalized operand being wrapped so that it can 
be input to the multiplier. Both input and output registers are enabled 
(PIPES2-PIPES0 = 010). Timing is shown in Figure 12. 

CLKMODE = 1 PIPES = 010 Operation: Wrap Denormal Input 
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Figure 1 2. Double-Precision ALU Operation, Input and Output Registers Enabled 
(PIPES - 010, CLKMODE _ 1) 



The fourth example shows a conversion from integer to floating point format. All three 
levels of data registers are enabled (PIPES2-PIPES0) so that the FPU is fully pipelined 
in this mode (see Figure 13). 

CLKMODE = 1 PIPES = 000 Operation: Convert Integer to Floating Point 
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Figure 13. Double-Precision ALU Operation, Alt Registers Enabled 
(PIPES - 000, CLKMODE - 1) 



Double-Precision Multiplier Operations 

Independent multiplier operations may also be performed in either clock mode and with 
various registers enabled. As before, examples for the two clock modes are treated 
separately. A double-precision multiply operation requires two clock cycles to execute 
(except in flowthrough mode) and from one to three other clock cycles to load the 
temporary register and to output the results, depending on the setting of 
PIPES2-PIPES0. 

Even in flowthrough mode (PIPES2-PIPES0 =111) two clock edges are required, the 
first to load half of the operands in the temporary register and the second to load the 
intermediate product in the multiplier pipeline register. Depending on the setting of 
CLKMODE, loading the temporary register may be done on either a rising or a falling 
edge. 

Double-Precision Multiplication with CLKMODE = 0 

In this first example, the A operand is multiplied by the absolute value of B operand. 
Timing for the operation is shown in Figure 14: 

CLKMODE = 0 PIPES = 111 Operation: Multiply A *|B| 
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Figure 14. Double-Precision Multiplier Operation, All Registers Disabled 
(PIPES - 111, CLKMODE - 0) 



The second example assumes that the RA and RB input registers are enabled. With 
CLKMODE = 0 one clock cycle is required to input both the double-precision operands. 
The multiplier is set up to calculate the negative product of | A| and B operands: 

CLKMODE - 0 PIPES =110 Operation: Multiply -(|A| * B) 
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Figure 15. Double-Precision Multiplier Operation, Input Registers Enabled 
(PIPES = 1 10, CLKMODE = 0) 
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Enabling both input and output registers in the third example adds an additional delay 
of one clock cycle, as can be seen from Figure 16. The sample instruction sets up 
calculation of the product of |A| and |B|: 

CLKMODE = 0 PIPES = 010 Operation: Multiply |A| * |B| 
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Figure 16. Double-Precision Multiplier Operation, Input and Output Registers Enabled 
(PIPES ■'- 010, CLKMODE =» 0) 
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With all registers enabled, the fourth example shows a microinstruction to calculate 
the negated product of operands A and B: 

CLKMODE = 0 PIPES = 000 Operation: Multiply -(A * B) 
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Figure 17. Double-Precision Multiplier Operation, All Registers Enabled 
(PIPES - 000, CLKMODE « 0) 



Double-Precision Multiplication with CLKMODE = 1 

Setting the CLKMODE control high causes the temporary register to load on the failing 
edge of the clock. This permits loading both double-precision operands within the same 
clock cycle. The time available to output the result is also affected by the settings 
of CLKMODE and PIPES2-PIPES0, as shown in the individual timing waveforms. 

The first multiplication example with CLKMODE set high shows a multiplication in 
flowthrough mode (PIPES2-PIPES0 = 111). Figure 18 shows the timing for this 
operating mode: 

CLKMODE = 1 PIPES = 111 Operation: Multiply A * |B| 
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Figure 18. Double-Precision Multiplier Operation, All Registers Disabled 
& (PIPES - 111, CLKMODE - 1) 
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Figure 19. Double-Precision Multiplier Operation, Input Registers Enabled 
(PIPES - 110, CLKMODE - 1) 
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With both input and output registers pipelined, the third example calculates the product 
of | A | and | B | . Enabling the output register introduces a one-cycle delay in outputting 
the result (see Figure 20): 

CLKMODE = 1 PIPES = 010 Operation: Multiply. |A| * |B| 
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Figure 20. Double-Precision Multiplier Operation, Input and Output Registers Enabled 
(PIPES « 010, CLKMODE = 1) 
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The fourth example shows the instruction and timing (Figure 21) to generate the 
negated product of the A and B operands. This operating mode with CLKMODE set 
high and all registers enabled permits use of the shortest clock period and produces 
the most data throughput, assuming that this is the primary operating mode in which 
the device is to function. 

Additional considerations affecting timing and throughput are discussed in the section 
on mixed operations and operands. 

CLKMODE = 1 PIPES = 000 Operation: Multiply -(A * B) 
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Figure 21. Double-Precision Multiplier Operation, All Registers Enabled 
(PIPES = 000, CLKMODE =1) 
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Chained Multiplier/ALU Operations 



Simultaneous multiplier and ALU functions can be selected in chained mode to support 
calculation of sums of products or products of sums. Operations selectable in chained 
mode (see Table 25) overlap partially with those selectable in independent multiplier 
or ALU operating mode. Format conversions, absolute values, and wrapping or 
unwrapping of denormal numbers are not available in chained mode. 

To calculate sums of products, the FPU can operate on external data inputs in the 
multiplier while the ALU operates on feedback from the previous calculation. The 
operand selects SELOPS7-SELOPS0 can be set to select multiplier inputs from the 
RA and RB registers and ALU inputs from the P and S registers. 

This mode of chained multiplier and ALU operation is used repeatedly in the division 
and square root calculations presented later. The sample microinstruction sequence 
shown in Tables 27 and 28 performs the operations for multiplying sets of data 
operands and accumulating the results, the basic operations involved in computing 
a sum of products. 

Table 27 represents the operations, clock cycles, and register contents for a single- 
precision sum of four products. Registers used include the RA and RB input registers 
and the product (P) and sum (S) registers. 



Table 27. Single-Precision Sum of Products (PIPES2-PIPES0 « 010) 
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G - RA, H - RB 




G * H 


E * F- P(EF) 


5 


S(AB + CD) +EF) + P(GH) 


S(AB + CD) + P(EF) - S(AB + CD + EF) 
G * H-P(GH) 


6 


New Instruction 


S(AB + CD + EF) + P(GH) - S(AB + CD + EF + GH) 



A microcode sequence to generate this sum of product is shown in Table 28. Only 
three instructions in chained mode are required, since the multiplier begins the 
calculation independently and the ALU completes it independently. 
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Table 28. Sample Microinstructions for Single-Precision Sum of Products 



€ 
L 
K 
M 
0 
D 
E 



00 PP 
N N II 
FF PP 
I I EE 
GG SS 
1-0 2-0 



C C 



SS 
E E 
L L 
0 0 
P P 
7-0 



S 
E 
L 
M 



S S 
E E 
L L 
S S 
TT 
1-0 



I I 

9-0 



R R 
N N 
D D 

1-0 



S B 

F EE S / Y 

ANNR_OOOT 
SRRCLEEE E 
TABCSYCSP 



S A TT 
EL PP 
T T 1-0 



R _ 
E H 



00 0100 0000 


0 


01 


010 


1111 


xxxx 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


1 1 


11 


10 0110 0000 


0 


01 


010 


1111 


xxxx 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


1 1 


11 


10 0000 0000 


0 


01 


010 


1111 


1010 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


1 1 


11 


10 0000 0000 


0 


01 


010 


XXXX 


1010 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


1 1 


11 


00 0000 0000 


0 


01 


010 


xxxx 


1010 


00 


0 


X 


X 


X 


X 


X 


X 


X 


X 


XX 


1 1 


11 


XX xxxx xxxx 


X 


XX 


XXX 


xxxx 


xxxx 


XX 


X 


X 


X 


X 


X 


0 


0 


0 


X 


XX 


1 1 


11 



Fully Pipelined Double-Precision Operations 

Performing fully pipelined double-precision operations requires a detailed understanding 
of timing constraints imposed by the multiplier. In particular, sum of products and 
product of sums operations can be executed very quickly, mostly in chained mode, Mj 
assuming that timing relationships between the ALU and the multiplier are coded 13 
properly. ^ m 

Pseudocode tables for these sequences are provided, (Table 29 and Table 30) showing CO 

how data and instructions are input in relation to the system clock. The overall patterns ^ 

of calculations for an extended sum of products and an extended product of sums j— 

are presented. These examples assume FPU operation in CLKMODE 0, with the CONFIG O 

setting HL to load operands by MSH and LSH, all registers enabled ^ 
(PIPES2 - PIPESO = LLL), and the C register clock tied to the system clock. 

In the sum of products timing table, the two initial products are generated in C/) 
independent multiplier mode. Several timing relationships should be noted in the table. 
The first chained instruction loads and begins to execute following the sixth rising 
edge of the clock, after the first product P1 has already been held in the P register 
for one clock. For this reason, P1 is loaded into the C register so that P1 will be stable 
for two clocks. 

On the seventh clock, the ALU pipeline register loads with an unwanted sum, P1 -f P1 . 
However, because the ALU timing is constrained by the multiplier, the S register will 
not load until the rising edge of CLK9, when the ALU pipe contains the desired sum, 
P1 + P2. The remaining sequence of chained operations then execute in the desired 
manner. 
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Table 29. Pseudocode for Fully Pipelined Double-Precision Sum of Products 
(CLKM = 0, CONFIG = 1 0, PIPES = 000, CLKC— *SYSCLK) 



CLK 


DA 
BUS 


DB 
BUS 


TEMP 
REG 


INS 
BUS 


INS 
REG 


RA 
REG 


RB 
REG 


MUL 
PIPE 


P 
REG 


C 
REG 


ALU 
PIPE 


S 
REG 


Y 
BUS 


_Ti 


AT MSH 


B1 MSH 


A1,B1MSH 


A1 *B1 




















J~2 


A1 LSH 


B1 LSH 


A1,B1MSH 


A1 *B1 


A1 *B1 


A1 


B1 














J"3 


A2 MSH 


B2 MSH 


A2,B2MSH 


A2*B2 


A1 *B1 


A1 


B1 


A1 *B1 












i ^ 


A2 LSH 


B2 LSH 


A2,B2MSH 


A2 * B2 


A2 * B2 


A2 


B2 


A1 *B1 














A3 MSH 


B3 MSH 


A3,B3MSH 


PR + CR 
A3 * B3 


A2*B2 


A2 


B2 


A2*B2 


P1 












A3 LSH 


B3 LSH 


A3,B3MSH 


PR + CR 
A3 * B3 


PR + CR, 
A3 * B3 


A3 


B3 


A2*B2 


P1 


P1 








_r 7 


A4 MSH 


B4 MSH 


A4,B4MSH 


PR + SR 
A4 * B4 


PR + SR, 
A3 *"B3 


A3 


B3 


A3*B3 


P2 


P1 


■PI +P1 








A4 LSH 


B4 LSH 


A4,B4MSH 


PR + SR 
A4 * B4 


PR + SR, 
A4 * B4 


A4 


B4 


A3*B3 


P2 


P1 


P1 +P1 








A5 MSH 


B5 MSH 


A5,B5MSH 


DD _l CD 

rn + on 
A5*B5 


DD _i_ CD 

rn + on, 
A4*B4 


A4 


B4 


A4*B4 


P3 


P2 


S1 +P2 


S1 




J-10 


A5 LSH 


B5 LSH 


A5,B5MSH 


PR + SR 
A5*B5 


PR + SR, 
A5*B5 


A5 


B5 


A4*B4 


P3 


P3 


S1 +P3 


S1 




'J""-' 


A6 MSH 


B6 MSH 


A6,B6(M) 


PR + SR 
A6* B6 


PR + SR, 
A5*B5 


A5 


B5 


A5*B5 


P4 


P3 


XXXXX 


S2 




J~12 






























Table 30. Pseudocode for Fully Pipelined Double-Precision Product of Sums 
(CLKM«0, CONFIG - 10, PIPES = 000, CLKC^SYSCLK) 



CLK 


DA 

Rl IQ 
DUO 


DB 

Rl IQ 
DUO 


TEMP 
per* 


INS 

Rl IQ 
DUO 


INS 
ncvs 


RA 
ncvi 


RB 
nco 


MUL 

pipe 
fire 


P 


c 


ALU 
Pipe 


s 

ncvi 


Y 

Rl IQ 
DUO 


-J 1 


A 1 /M\ 




A 1 Rl IKA\ 
M 1 , D 1 \ 1 VI; 


A 1 4- R 1 




















J"2 


A1(U 


B1(L) 


A1,B1(M) 


A1 + B1 


A1 + B1 


A1 


B1 














I ^ 

_r 3 


A2(M) 


B2(M) 


A2,B2(M) 


A2 + B2 


A1 +B1 


A1 


B1 








A1+B1 








A2(L) 


B2(L) 


A2,B2(M) 


A2 + B2 


A2 + B2 


A2 


B2 








A1 +B1 


S1 






A3{M) 


B3(M) 


A3 f B3(M) 


CR*SR 
A3 + B3 


A2 + B2 


A2 


B2 






S1 


A2 + B2 


S1 




_r 6 


A3(L) 


B3(L) 


A3,B3(M) 


CR*SR 
A3 + B3 


CR*SR 
A3 + B3 


A3 


B3 






S1 


A2 + B2 


S2 




_r 7 


XXX 


XXX 


XXX 


SP Add 


CR*SR 
A3 + B3 


A3 


B3 


S1 *S2 




S1 


A3 + B3 


S2 




_T 8 


A4(M) 


B4(M) 


A4,B4(M) 


PR*SR 
A4 + B4 


CR*SR 
A3 + B3 


ENRA = L 
A3 


ENRB=L 
B3 


S1 *S2 




S1 


A3 + B3 


XXX 




_r ^ 


A4(L) 


B4(L) 


A4,B4(M) 


PR*SR 
A4 + B4 


PR*SR 
A4 + B4 


A4 


B4 


XXX 


P1 


S1 


XXX 


S3 




J-io 


XXX 


XXX 


XXX 


SP Add 


PR*SR 
A4 + B4 


A4 


B4 


P1 *S3 


P1 


S1 


A4 + B4 


S3 






A5(M) 


B5(M) 


A5,B5(M) 


PR*SR 
A5 + B5 


PR*SR 
A4 + B4 


ENRA = L 
A4 


ENRB = L 
B4 


P1 *S3 


XXX 


S1 


A4 + B4 


XXX 






A5(L) 


B5(L) 


A5,B5(M) 


PR*SR 
A5 + B5 


PR*SR 
A5 + B5 


A5 


B5 


XXX 


P2 


S1 


XXX 


S4 





NOTE: On CLK 7 and CLK 10, put 0000000000 (Single-Precision Add) on the instruction bus. 
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In the product of sums timing table, the two initial sums are generated in independent 
ALU mode. The remaining operations are shown as alternating chained operations 
followed by single-precision adds. The SP adds are necessary to provide an extra cycle 
during which the multiplier outputs the current intermediate product/ The current sum 
and the latest intermediate product are then fed back to the multiplier inputs for the 
next chained operations. In this manner, a double-precision product of sums is 
generated in three system clocks, as opposed to two clocks for a double-precision 
sum of products. 

Mixed Operations and Operands 

Using mixed-precision data operands or performing sequences of mixed operations 
may require adjustments in timing, operand precision, and control settings. To simplify 
microcoding sequences involving mixed operations, mixed-precision operands, or both, 
it is useful to understand several specific requirements for mixed-mode or mixed- 
precision processing. 

Calculations involving mixed-precision operands must be performed as double-precision 
operations (see Table 12). The instruction settings (I8-I7) should be set to indicate 
the precision of each operand from the RA and RB input registers. (Feedback operands 
from internal registers are also double-precision.) Mixed-precision operations should 
not be performed in chained mode. 

Timing for operations with mixed-precision operands is the same as for a corresponding 
double-precision operation. In a mixed-precision operation, the single-precision operand 
must be loaded into the upper half of its input register. 



■jjj Most format conversions also involve double-precision timing. Conversions between 

4> single- and double-precision floating point format are treated as mixed-precision 

J* operations. During integer to floating point conversions, the integer input should be 

Ijj loaded into the upper half of the RA register. 
00 

00 In applications where mixed-precision operations is not required, it is possible to tie 

J*j the I8-I7 instruction inputs together so that both controls always select the same 
precision. 
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Sequences of mixed operations may require changes in multiple control settings to 
deal with changes in timing of input, execution, and output of results. Figure 22 shows 
a simplified timing waveform for a series of mixed operations: 



CLOCK CYCLE 





1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


FUNCTION 
AND DATA 


A,B 


A,B 


CD 


CD 




E,F 


G,H 


G,H 


U 


U 


K.L 


M,N 




RESULTS 
AND STATUS 






xxxx 


A,B 


xxxx 


CD 


E.F 


E.F 


G,H 


G.H 


I, J 


K.L 


M,N 



A,B,C,D — double precision multiply; E,F ■— single precision operation; G,H,I,J — double 
precision add; K,L — single precision opration. A double precision number is not required to 
be held on the outputs for two cycles unless it is followed by a like double precision function. 
If a double precision multiply is followed by single precision operation, there must be one open 
clock cycle. 

Figure 22. Mixed Operations and Operands 
(PIPES2-PIPES0 - 110, CLKMODE - 0) 



In this sequence, the fifth cycle is left open because a single-precision multiply follows 
a double-precision multiply. If the SP multiply were input during the period following 
the fourth rising clock edge, the result of the preceding operation would be overwritten, 
since an SP multiply executes in one clock cycle. To avoid such a condition, the FPU 
will not load during the required open cycle. 

CO 

Because the sequence of mixed operations places constraints on output timing, only 
one cycle is available to output the double-precision (C * D) result. By contrast, the h- 
SP multiply (E * F) is available for two cycles because the operation which follows ^ 
it does not output a result in the period following the seventh rising clock edge. In ^ 
general, the precision and timing of each operation affects the timing of adjacent r* 
operations. 
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Control settings for CLKMODE and registers must also be considered in relation to 
precision and speed of execution. In Figure 23, a similar sequence of mixed operations 
is set up for execution in fully pipelined mode: 



CLOCK CYCLE 




FUNCTION 
AND DATA 

RESULTS 
AND STATUS 



A,B 




CD 




E,F 


G,H 


U 


K,L 


M,N 


0,P 




Q,R 












A,B 


A,B 


CD 


E,F 


G,H 


U 


K,L 


M,N 


M,N 



A,B,C,D — double precision multiply; E,F — single precision operation; G,H, — double precision 
add; l,J,K,L,M,N — single precision operation; 0,P,Q,R — double precision multiply. In clock 
mode 1 , a double precision result is two cycles long only when a double precision multiply is 
followed by a double precision multiply. 



Figure 23. Mixed Operations and Operands 
(PIPES2-PIPES0 - 000, CLKMODE - 1) 



Although the data operands can be loaded in one clock cycle with CLKMODE set high, 
enabling two additional internal registers delays the (A * B) result one cycle beyond 
the previous example. Again, an open cycle is required after the (C * D) operation 
because the next operation is single precision. The result of the (C * D) multiply is 
(/) available for one cycle instead of two, also because the following operation is single 
2 precision. With this setting of CLKMODE and PIPES2-PIPES0, a double-precision result 
is only available for two clock cycles when one DP multiply follows another DP multiply. 

> 

£j Matrix Operations 

The 'ACT8837 floating point unit can also be used to perform matrix manipulations 
CO involved in graphics processing or digital signal processing. The FPU multiplies and 
^ adds data elements, executing sequences of microprogrammed calculations to form 

new matrices. 

Representation of Variables 

In state representations of control systems, an n-th order linear differential equation 
with constant coefficients can be represented as a sequence of n first-order linear 
differential equations expressed in terms of state variables: 

dx1 n dx(n- 1) 

—:— = x 2, . . . , — , = xn 
dt dt 



5-92 



For example, in vector-matrix form the equations of an nth-order system can be 
represented as follows: 





x1 






x2 




d 






dt 








xn 





a11 


a12 . . 


a1n 


an1 


an2 . . 


. . ann 





x1 






x2 


+ 




xn 





b11 


. . bin 


bn1 


. . bnn 



un 



or, X = ax + bu 

Expanding the matrix equation for one state variable, dx1/dt, results in the following 
expression: 



X1 = (a1 1 * x1 + . . . + a1n .* xn) + (b11. * u1 + . . . + bin .* un) 
where X1 = dx1/dt. 

Sequences of multiplications and additions are required when such state space 
transformations are performed, and the 'ACT8837 has been designed to support such 
sum-of-products operations. An n x n matrix A multiplied by an n x n matrix X yields 
an n x n matrix C whose elements cij are given by this equation: 



cij = aik * xkj fori = 1, 
k=1 



,n j = 1, 



,n 



(1) 



For the cij elements to be calculated by the 'ACT8837, the corresponding elements 
aik and xkj must be stored outside the 'ACT8837 and fed to the 'ACT8837 in the 
proper order required to effect a matrix multiplication such as the state space system 
representation just discussed. 

Sample Matrix Transformation 

The matrix manipulations commonly performed in graphics systems can be regarded 
as geometrical transformations of graphic objects. A matrix operation on another matrix 
representing a graphic object may result in scaling, rotating, transforming, distorting, 
or generating a perspective view of the image. By performing a matrix operation on 
the position vectors which define the vertices of an image surface, the shape and 
position of the surface can be manipulated. 



oo 
oo 
oo 

O 
< 

z 
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The generalized 4x4 matrix for transforming a three-dimensional object with 
homogeneous coordinates is shown below: 



a 


b 


c 


d 


e 


f 


9 


h 




i 


k 


1 


m 


n 


0 


P 



The matrix T can be partitioned into four component matrices, each of which produces 
a specific effect on the resultant image: 





3 


3x3 


X 




1 


1 x 3 


1 X 1 




z 

> 

o 

H 
00 
00 
GO 



The 3x3 matrix produces linear transformation in the form of scaling, shearing and 
rotation. The 1 x 3 row matrix produces translation, while the 3x1 column matrix 
produces perspective transformation with multiple vanishing points. The final single 
element 1 x 1 produces overall scaling. Overall operation of the transformation matrix 
T on the position vectors of a graphic object produces a combination of shearing, 
rotation, reflection, translation, perspective, and overall scaling. 

The rotation of an object about an arbitrary axis in a three-dimensional space can be 
carried out by first translating the object such that the desired axis of rotation passes 
through the origin of the coordinate system/then rotating the object about the axis 
through the origin, and finally translating the rotated object such that the axis of rotation 
resumes its initial position. If the axis of rotation passes through the point P = [a b c 1 ], 
then the transformation matrix is representable in this form: 



[xyzh] = [xyzl] 



1 


0 


0 


0 


0 


1 


0 


0 


0 


0 


1 


0 




-b 


-c 


1 



T 



T 



1 


0 


0 


0 


0 


1 


0 


0 


0 


0 


1 


0 


a 


b 


c 


1 



T 



(2) 



translation 
to origin 



rotation 
about 
origin 



translation 
back to initial 
position 
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where R may be expressed as: 





n1 2 + d-n)2 C os<£ 


n 1 n2( 1 -cos0) + n3sin<£ 


n 1 n3( 1 -cos<£) - n2sin<£ 


0 


R * 


n 1 n2( 1 -cos<£) - n3sin<£ 


n2 2 + (1-n2) 2 cos<£ 


r»2n3( 1 -co$<f>) + n 1 sin<£ 


0 




n 1 n3( 1 -cos</>) + n2sin# 


n2n3( 1 -cos<£) - n 1 s\n<t> 


n3 2 + (1-n3) 2 cos<£ 


0 




0 


0 


0 


1 



and n1 = q1/(q1 2 + q2 2 + q3 2 ) 1 / 2 = direction cosine for x-axis of 

rotation 

n2 = q2/(q1 2 + q2 2 + q3 2 p/ 2 = direction cosine for y-axis of rotation 
n3 = q3/(q1 2 + q2 2 +q3 2 )^ 2 = direction cosine for z-axis of rotation 
n =? {n1 n2 n3) = unit vector for Q 

Q = vector defining axis of rotation = [q1 q2 q3] 

<f> = the rotation angle about Q HI 

CO 

A general rotation using equation (2) is effected by determining the [x y z] coordinates qq 
of a point A to be rotated on the object, the direction cosines of the axis of rotation 00 
[n1, n2, n3], and the angle <t> of rotation about the axis, all of which are needed to q 
define matrix [R]. Suppose, for example, that a tetrahedron ABCD, represented by < 
the coordinate matrix below is to be rotated about an axis of rotation RX which passes jjf 
through a point P = [5 - 6 3 1 1 and whose direction cosines are given by unit vector 
[n1 = 0.866, n2 = 0.5, n3 = 0.707]. The angle of rotation 0 is 90 degrees (see </) 
Figure 24). The rotation matrix [R] becomes 



2 


-3 


3 


1 


1 


-2 


2 


1 


2 


-1 


2 


1 


2 


-2 


2 


1 



A 
B 
C 
D 



R = 



0.750 


1.140 


0.1 12 


0 


-0.274 


0.250 


1.220 


0 


1.112 


-0.513 


0.500 


0 


0 


0 


0 


1 
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(1) THIS ARROW DEPICTS THE FIRST TRANSLATION 

5 {2) THIS ARROW DEPICTS THE 90° ROTATION 
(3) THIS ARROW DEPICTS THE BACK TRANSLATION 

(/> Figure 24. Sequence of Matrix Operations 

Z 

<l 

^ The point transformation equation (2) can be expanded to include all the vertices of 

I> the tetrahedron as follows: 

O 

xa ya za hi 
xb yb zb h2 
xc yc zc h3 
xd yd zd h4 



2 


-3 


3 


1 


1 


-2 


2 


1 


2 


-1 


2 


1 


2 


-2 


2 


1 



10 0 0 
0 1 00 
00 10 
-5 6-3 1 



T 



translation 
to origin 



0.750 1.140 0.112 0 
-0.274 0.250 1.22 0 
1.112-0.513 0.500 0 
0 0 0 1 



i 



rotation about origin 



1 


0 


0 0 


0 


1 


0 0 


0 


0 


1 0 


5- 


6 


3 1 



(3) 



translation 
back to 

initial 
position 
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The 'ACT8837 floating-point unit can perform matrix manipulation involving 
multiplications and additions such as those represented by equation (1). The matrix 
equation (3) can be solved by using the 'ACT8837 to compute, as a first step, the 
product matrix of the coordinate matrix and the first translation matrix of the right- 
hand side of equation (3) in that order. The second step involves postmultiplying the 
rotation matrix by the product matrix. The third step implements the back-translation 
by premultiplying the matrix result from the second step by the second translation 
matrix of equation (3). Details of the procedure to produce a three-dimensional rotation 
about an arbitrary axis are explained in the following steps: 

Step 1 

Translate the tetrahedron so that the axis of rotation passes through the origin, This 
process can be accomplished by multiplying the coordinate matrix by the translation 
matrix as follows: 



2 


-3 


3 


1 


1 


-2 


2 


1 


2 


-1 


2 


1 


2 


-2 


2 


1 



1 


0 


0 


0 




(2-5) 


(-3 + 6) 


(3-3) 


1 


0 


1 


0 


0 




(1-5) 


(-2 + 6) 


(2-3) 


1 


0 


0 


1 


0 




(2-5) 


(-1+6) 


(2-3) 


1 


-5 


6 


-3 


1 




(2-5) 


(-2 + 6) 


(2-3) 


1 



translation 
to origin 



-3 


+ 3 


0 


1 


-4 


+ 4 


-1 


1 


-3 


+ 5 


-1 


1 


-3 


+ 4 


-1 


1 



vertices of translated 
tetrahedron 



AT 
BT 
CT 
DT 



oo 
oo 
oo 
h- 
o 
< 

z 
c/) 



The 'ACT8837 could compute the translated coordinates AT, BT, CT, DT as indicated 
above. However, an alternative method resulting in a more compact solution is 
presented below. 
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Step 2 



Rotate the tetrahedron about the axis of rotation which passes through the origin after 
the translation of Step 1 . To implement the rotation of the tetrahedron, postmultiply 
the rotation matrix [R] by the translated coordinate matrix from Step 1 . The resultant 
matrix represents the rotated coordinates of the tetrahedron about the origin as follows: 



-3 


3 


0 


1 


-4 


4 


-1 


1 


-3 


5 


-1 


1 


-3 


4 


-1 


1 



0.750 


■1.140 


0.112 


0 




-3.072 


-2.670 


3.324 


1 


-0.274 


0.250 


1.22 


0 




-5.208 


-3.047 


3.932 


1 


1.112 


-0.513 


0.500 


0 




-4.732 


-1.657 


5.264 


1 


0 


0 


0 


1 




-4.458 


-1.907 


4.044 


1 



1 



rotation about origin 



rotated coordinates 



Step 3 

Translate the rotated tetrahedron back to the original coordinate space. This is done 
by premultiplying the resultant matrix of Step 2 by the translation matrix. The following 
calculations produces the final coordinate matrix of the transformed object: 



> 

O 
H 
00 
00 
00 
^1 



-3.072 


-2.670 3.324 


1 




1 


0 


0 


0 




1.928 


-8.670 


6.324 


1 


-5.208 


-3.047 3.932 


1 




0 


1 


0 


0 




-0.208 


-9.047 


6.932 


1 


-4.732 


-1.657 5.264 


1 




0 


0 


1 


0 




0.268 


-7.657 


8.264 


1 


-4.458 


-1.907 4.044 


1 




5 


-6 


3 


1 




0.542 


-7.907 


7.044 


1 



i 



i 



translate back 



final rotated coordinates 
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A more compact solution to these transformation matrices is a product matrix that 
combines the two translation matrices and the rotation matrix in the order shown in 
equation (3). Equation (3) will then take the following form: 



xa ya za hi 
xb yb zb h2 
xc yc zc h3 
xd yd zd h4 



2 


-3 


3 


1 


1 


-2 


2 


1 


2 


-1 


2 


1 


2 


-2 


2 


1 



0.750 


1.140 


0.112 


0 


-0.274 


0.250 


1.220 


0 


1.112 


-0.513 


0.500 


0 


-3.730 


-8.661 


8.260 


1 



T 



transformation matrix 



The newly transformed coordinates resulting from the postmultiplication of the 
transformation matrix by the coordinate matrix of the tetrahedron can be computed 
using equation (1) which was cited previously: 



n 

c »i =2 

k = 1 



aik * xkj for i = 1, . . . ,n j = 1, 



(1) 



CO 

oo 

CO 

O 
< 



CO 
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For example, the coordinates may be computed as follows: 



xa = c11 = a11 * xl 1 + a.1.2 * x21 + a13 * x31 + a14 * x41 

= 2 * 0.750 + (-3) * (-0.274) + 3 * 1.112 + 1 * (-3.73) 

= 1.5 + 0.822 + 3.336 - 3.73 

= 1.928 

ya = c12 = a11 * x12 + a12 * x22 + a13 * x32 + a14 * x42 

= 2 * 1.140 + (-3) * 0.250 + 3 * (-0.513) + 1x(-8.661) 

= 2.28 -0.75 - 1.539 - 8.661 

= -8.67 



za = c13 = a11 * x13 + a12 * x23 + a13 * x33 + a .14 * x43 

= 2 * 0.112 + (-3) * 1.220 -f 3 * 0.500 + 1 * 8.260 

= 0.224 - 3.66 + 1.5 + 8.260 

= 6.324 



hi = C 14 = a1 1 * x14 + a12 * x24 + a13 * x34 + a14 * x44 
= 2 * 0 + (-3) * 0 + 3 * 0 + 1 * 1 
= 0 + 0 + 0 + 1 

= .1 

A' = [1.928 - 8.67 6.324 1] 

^ The other rotated vertices are computed in a similar manner: 

5 B' = t- 5.208 -3.047 3.932 1] 

> C = [-4.732 -1.657 5.264 1) 

> D' = [-4.458 -1.907 4.044 1] 
O 

-H ■ 
00 

00 Microinstructions for Sample Matrix Manipulation 

The 'ACT8837 FPU can compute the coordinates for graphic objects over a broad 
dynamic range. Also, the homogeneous scalar factors hi , h2, h3 and h4 may be made 
unity due to the availability of large dynamic range. In the example presented below, 
some of the calculations pertaining to vertex A' are shown but the same approach 
can be applied to any number of points and any vector space. 

The calculations below show the sequence of operations for generating two 
coordinates/ xa and ya, of the vertex A.' after rotation. The same sequence could be 
continued to generate the remaining two coordinates for A' (za and hi). The other 
vertices of the tetrahedron, B', C, and D', can be calculated in a similar way. 
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A microcode sequence to generate this matrix multiplication is shown in Table 31. 
Table 32 presents a pseudocode description of the operations, clock cycles, and register 
contents for a single-precision matrix multiplication using the sum-of-products sequence 
presented in an earlier section. Registers used include the RA and RB input registers 
and the product (P) and sum (S) registers. 

Table 31. Microinstructions for Sample Matrix Multiplication 



c 


C C 








L 


00 PP 


S S 






K 


N.N I I 


E E 






M 


F F P P 


LL 


R R 


F 


0 


II EE 


0 0 


N N 


A 


D 


GGSS 


P P 


D D 


S 


E 


1-0 2-0 


7-0 


1-0 


T 



E 
L 







M 




S S 








S 


B 


E E R 




E 


E 


S / 


Y 


L L E H 




N 


N 


R 0 0 


0 T 


S S S A 


T T 


R 


R 


C L E E 


E E 


TT E L 


P P 


A 


B 


C S Y C 


S P 


1-0 T T 


1-0 



00 0100 0000 


0 


01 


0101111 


xxxx 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


10 0110 0000 


0 


01 


0101111 


xxxx 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


1 0 0000 0000 


0 


01 


010 1111 


1010 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


10 0000 0000 


0 


01 


010 1111 


1010 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


10 0000 0000 


0 


01 


010 1111 


1010 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


10 0110 0000 


0 


01 


0101111 


xxxx 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


10 0000 0000 


0 


01 


010 1111 


1010 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


1 0 0000 0000 


0 


01 


010 1111 


1010 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


10 0000 0000 


0 


01 


010 1111 


1010 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


10 0110 0000 


0 


01 


0101111 


xxxx 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 



Six cycles are required to complete calculation of xa, the first coordinate, and after 
four more cycles the second coordinate ya is output. Each subsequent coordinate can 
be calculated in four cycles so the 4-tuple for vertex A' requires a total of 18 cycles 
to complete. 

Calculations for vertices B', C, and D', can be executed in 48 cycles, 16 cycles for 
each vertex. Processing time improves when the transformation matrix is reduced, 
i.e., when the last column has the form shown below: 



CO 

oo 
oo 
h- 
O 
< 

r> 
z 



0 
0 
0 

1 



5-101 



Table 32. Single-Precision Matrix Multiplication (PIPES2-PIPES0 - 010) 



CLOCK 
CYCLE 


MULTIPLIER/ALU 
OPERATIONS 


PSEUDOCODE 


1 


Load a1 1, x1 1 


a11 - RA, x11 -RB 




SP Multiply 


p1 = a11 * x11 


2 


Load a12, x21 


a12 -RA, x21 -RB 




SP Multiply 


p2 = a12 * x21 




Pass P to S 


p1 -P(pD 


3 


Load a 13, x31 


a13 - RA, x31 -RB 




SP Multiply 


p3 = a13 * x31, p2 -P(p2) 




Add P to S 


P(p1) + 0- S(p1) 


4 


Load a14, x41 


a14 - RA, x41 - RB 




SP Multiply 


P 4 = a14 * x41, p3 - P(p3) 




AddPtoS 


P(p2) + S(p1) - S(p1 + p2) 


5 


Load a1 1, x12 


a11 - RA, x12 - RB 




SP Multiply 


p5 = a11 ' * x12, p4 - P(p4) 




AddPtoS 


P(p3) + S(p1 + p2) - S(p1 + p2 + p3) 


6 


Load a12, x22 


a12 - RA, x22 - RB 




SP Multiply 


p6 = a12 * x22, p5 - P(p5) 




Pass P to S 


P(p4) + S(p1 + p2 + p3) - 




Output S 


S(p1 + p2 + p3 + p4) 


7 


Load a 13, x32 


a13 -RA, x32- RB 




SP Multiply 


p7 = a13 * x32, p6 - P(p6) 




Add P to S 


P(p5) + 0- S(p5) 


8 


Load a 14, x42 


a14-RA, x42 -RB 




SP Multiply 


p8 = a14 * x42, p7 - P(p7) 




Add P to S 


P(p6) + S(p5)- S(p5 + p6) 


9 


Next operands 


A - RA, B - RB 




Next instruction 


pi = A * B, p8- P(p8) 




Add P to S 


P(p7) + S(p5 + p6) - S(p5 + p6 + p7) 


10 


Next operands 


C - RA, D - RB 




Next instruction 


pj = C * D, pi - P(pi) 




Output S§ 


P(p8) + S(p5 + p6 + p7) — 






S(p5 + p6 + p7 + p8) 



The h-scalars hi , h2, h3, and h4 are equal to 1 . The number of clock cycles to generate 
each 4-tuple can then be decreased from 1 6 to 13 cycles. Total number of clock cycles 
to calculate all four vertices is reduced from 66 to 54 clocks. Figure 25 summarizes 
the overall matrix transformation. 
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P {5, -6, 3) 



Figure 25. Resultant Matrix Transformation 



This microprogram can also be written to calculate sums of products with all pipeline 
registers enabled so that the FPU can operate in its fastest mode. Because of timing 
relationships, the C register is used in some steps to hold the intermediate sum of 
products. Latency due to pipelining and chained data manipulation is 1 1 cycles for 
calculation of the first coordinate, and four cycles each for the other three coordinates. 

After calculation of the first vertex, 16 cycles are required to calculate the four 
coordinates of each subsequent vertex. Table 33 presents the sequence of calculations 
for the first two coordinates, xa and ya. 



CO 

oo 
oo 
h- 
o 
< 

z 
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Table 33. Fully Pipelined Sum of Products (PIPES2-PIPES0 = 000) 
(Bus or Register Contents Following Each Rising Clock Edge) 



CLOCK 
CYCLE 


1 

BUS 


DA 
BUS 


DB 
BUS 


I 

REG 


RA 
REG 


RB 
REG 


MUL 
PIPE 


ALU 
PIPE 


P 
REG 


s 

REG 


c 

REG 


Y 
BUS 


o 


Mul 


xl 1 


a1 1 




















1 


Mul 


x21 


a12 


Mul 


x1 1 


a1 1 














2 


Chn 


x31 


a1 3 


Mul 


x21 


a1 2 


P 1 












3 


Mul 


x41 


a14 


Chn 


x31 


a1 3 


P z 




r»1 
P 1 








4 


Chn 


x1 2 


a1 1 


Mul 


x41 


a14 


po 


s1 


P^ 








5 


Chn 


x22 


a1 2 


Chn 


x12 


a1 1 


□4 


t 


d3 


s1 


□2 




6 


Chn 


x32 


a13 


Chn 


x22 


a12 


P 5 


s2 


P4 


t 


P2 




7 


Chn 


x42 


a14 


Chn 


x32 


a13 


P6 


s3 


P5 


s2 


P2 




8 


Chn 


x13 


a11 


Chn 


x42 


a14 


P7 


s4 


P6 


s3 


s2 




9 


Chn 


x23 


a12 


Chn 


x13 


a11 


P8 


xa 


P7 


s4 


P6 




10 


Chn 


x33 


a13 


Chn 


x23 


a12 


P9 


s5 


P8 


xa 


P6 


xa 


11 


Chn 


x43 


a14 


Chn 


x33 


a13 


p10 


s6 


P9 


s5 


P6 




12 


Chn 


x14 


a11 


Chn 


x43 


a14 


p11 


s7 


p10 


s6 


s5 




13 


Chn 


x24 


a12 


Chn 


x14 


a11 


P12 


ya 


P11 


s7 


p10 




14 


Chn 


x34 


a13 


Chn 


x24 


a12 


p13 


s8 


P12 


ya 


p10 


ya 


15 


Chn 


x44 


a14 


Chn 


x34 


a13 


p14 


s9 


p13 


s8 


p10 





1" Contents of this register are not valid during this cycle. 

Products in Table 33 are numbered according to the clock cycle in which the operands 
and instruction were loaded into the RA, RB, and I register, and execution of the 
instruction began. Sums indicated in Table 33 are listed below: 



0) 

z 

> 
o 

H 
00 
00 
00 



s1 = p1 + 0 
s2 = p1 + p3 
s3 = p2 + p4 
s4 = p5 + 0 



s5 = p5 + p7 
s6 = p6 + p8 
s7 = p9 + 0 
s8 = p9 + p11 



s9 = p10 + p12 

xa = p1 + p2 + p3 + p4 

ya = p4 + p5 + p6 + p7 
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SAMPLE MICROPROGRAMS FOR BINARY DIVISION AND 
SQUARE ROOT 

The SN74ACT8837 Floating Point Unit supports binary division and square root 
calculations using the Newton-Raphson algorithm. The 'ACT8837 performs these 
calculations by executing sequences of floating-point operations according to the 
control settings contained in specific microprogrammed routines. This implementation 
of the Newton-Raphson algorithm requires that a seed ROM provide values for the 
first approximations of the reciprocals of the divisors. 

This application note presents several microprograms for floating-point division and 
square root using the Newton-Raphson algorithm. Each sample program is analyzed 
briefly to show details of the floating-point procedures being performed. 

Binary Division Using the Newton-Raphson Algorithm 

Binary division can be performed as an iterative procedure using the Newton-Raphson 
algorithm. For a dividend A, divisor B, and quotient Q, this procedure calculates a value 
for 1/B which is then used to evaluate the expression Q = A * 1/B. The calculation 
can be performed with either single- or double-precision operands, and examples of 
each precision are shown. 

The basic algorithm calculates the value of a quotient Q by approximating the reciprocal 
of the divisor B to adequate precision and then multiplying the dividend A by the 
approximation of the reciprocal: 

Q = A/B = A * Xn, where Xn = the value of X after the nth iteration 

n = the number of iterations to achieve the 
desired precision 

Intermediate values of X are calculated using the following expression: 

Xi + 1 = Xi * (2 - B * Xi), where XO = approximates 1/B for 

the range 0 <X0 < 2/B 

To illustrate a program using the Newton-Raphson algorithm, the sequence of 
calculations is presented in detail. For double-precision operations, three iterations are 
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needed to achieve adequate precision in the value of 1/B. A value for the seed XO 
(approximately equal to 1/B) is assumed to be given, and the following operations are 
performed to evaluate Q from double-precision inputs: 

X1 = X0(2 - B * XO) 

X2 = X1(2 - B * X1) = X0(2 - B * XO) * (2 - B * X0(2 - B * XO)) 
X3 = X2(2 - B * X2) 

X3 = X0(2-B * XO) * (2-B * X0(2-B * XO)) * (2-B * XO * (2-B 

* XO) * (2-B * XO * (2-B * XO))) 

Q = A * 1/B = A * X3 

A/B = A * XO(2-B * XO) * (2-B * X0(2-B * XO)) * (2-B * XO 

* (2-B * XO) * (2-B * XO * (2-B * XO))) 



X1 



X1 



X1 



X1 



X2 



X2 



X3 



(A 



Table 36 presents decimal and hexadecimal values for A, B, and XO, which are used 
5 in the sample calculation. The computed value of the quotient Q is also included, 
£ showing the representations of the results of this sample division. 



> 

O 

00 
00 
GO 



Table 34. Sample Data Values and Representations 



TERM 


DECIMAL REPRESENTATION 


IEEE HEXADECIMAL 
REPRESENTATION 


VALUE 


MANTISSA • 2 EXPONENT 


A 


22 




1.375 * 2 4 


40360000 00000000 


B 


7 




1.75 * 2 2 


401C0000 00000000 


XO 


1/7 




1.140625 * 2 (-3) 


3FC24000 00000000 


Q 


22/7 




1.5714285714285713 * 2 1 


40092492 49249249 



In Table 35, the sequence and timing of this procedure is shown exactly as performed 
by the 'ACT8837. This example shows the steps in a double-precision division requiring 
three iterations to achieve the desired accuracy. In this table each operation is 
sequenced according to the clock cycles during which the instruction inputs for that 
operation are presented at the pins of the 'ACT8837. Operations are accompanied 
by a pseudocode summary of the operations performed by the ' ACT8837 and the clock 
cycle when an operand is available or a result is valid. 

Each line of pseudocode indicates the operands being used, the operations being 
performed, the registers involved, and the clock cycles when the results appear. Each 
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register is represented by its usual abbreviation (RA, RB, P, S, or C) followed by the 
number of the clock cycle when an operand will be valid or available at the register. 
For example, "P. 4" refers to the contents of the Product Register after the fourth 
clock cycle. 



Table 35. Binary Division Using the IMewton-Raphson Algorithm 



CLOCK 
CYCLES 


OPERATIONS 


PSEUDOCODE 


1, 2 




B * XO 




B - RA.2, XO - RB.2 
RA.2 * RB.2 - P.4 


3, 4 




2 - B * XO 




2 - P.4'-* S.6 


5, 6 


X1 


= X0{2 - B * 


XO) 


RB.2 * S.6 - P.8 


7, 8 




B * X1 




RA.2 * P.8 -> P.10 


9, 10 




2 ~ B * X1 




P.8 -> C.9, 2 - P.10 - S. 12 


11, 12 


X2 


= X1 (2 - B * 


XT) 


C.9 * S.12 P.14 


13, 14 




B * X2 




RA.2* P.14 P. 16 


15, 16 




2 - B * X2 




P.14-* C.15, 2 - P. 16- S.18 


17, 18 


X3 


- X2(2 - B * 


X2) 


A RA.18, C.1 5 * S.18-P.20 


19, 20 




A * X3 




RA.18 * P.20 P.22 


21, 22 




Output MSH 




P.22.MSH -* Y 



The sequence of operations can be microcoded for execution exactly as listed in the R| 
table above. Sample microprograms {with data and parity fields provided) are given gJl 
below. To make the programs easier to follow, comment lines have been included to 
indicate clock timing, calculation performed by the instructions being loaded, and £j 
operations being represented, in the same pseudocode as in the preceding table. The 00 
fields in the microinstruction sequences presented below are arranged in the following £2 
order: 

S 

I E 

N C C L 

S L G M 
L T K 0 P S S B S R _ T 
I C RMNIE. F E E S / _ YE EH ED D 
N C L U 0 F P L R A N N R _ 0 0 0 T L S A S A D B D 

ELK C DIEONSRRCLEEEESELT3 A3 B 

# K C T E GS PDTABCSYC3PTTTP 1 01-----0 

d h.h 'hhh h h h hh h h h h h h h h h h h h h h hhhhhhhh hhhhhhhh 



P P 
A B 



O 
< 



h h 



All fields in the sample microcode sequences (except for line numbers) are represented 
as hexadecimal numbers. Line numbers are the only decimal numbers in the samples. 
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Single-Precision Newton-Raphson Binary Division 

Use of the Newton-Raphson algorithm is similar for both single- and double-precision 
operands. However, for implementations which handle both single- and double- 
precision division, it may be preferable to use a double-precision seed ROM, converting 
the double-precision seeds to single precision when necessary. 

The following sample program involves conversion of a double-precision seed XO for 
use in single-precision division. Since B is given as a single-precision number, it must 
be converted to double precision in order to address a double-precision seed ROM. 
Then the seed XO, which is double precision, must be converted to single precision 
for the actual calculation. 

Two iterations are used in the single-precision example. Thus, the formula 
Q = A * 1/B may be rewritten with n = 2: 

Q = A * 1/B = A * X2 




where X2 = X1 * (2 - B * X1) and X1 = XO * (2 - B * XO) 

A * 1/B = A * XO * (2 - B * XO) * [2 - B * XO * (2 - B * XO)] 

Table 36 presents a single-precision division using a double-precision seed ROM. This 
example divides 22/7. 



CO 

z 

> 
o 

H 
00 
00 
CO 
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Table 36. Single-Precision Newton-Raphson Binary Division 



Lines 1-2 Calculation: B s.p. -** d.p. 

Operations: B RA.1 , (s.p. to d.p.)(RA.1 ) -* S.2 

01 0 0 026 1 1 3 FF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 40E00000 00000000 0 0 

02 1 0 026 1 1 3 FF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 40E00000 00000000 0 0 



Lines 3-4 Calculation: Load XO 

Operations: XO RA. 4 

03 001 26 102 FF 00101 10000 3 1 1 3 3FC24000 00000000 0 0 

04 1 0 1 26 1 0 2 FF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 3FC24000 00000000 0 0 



Lines 5-6 



Calculation: XO d.p. -* s.p. 
Operations: (d.p. to s.p.){RA.4) — S.6 



05 0 0 126 1 0 2 FF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 3FC24000 00000000 0 0 ! 

06 1 0 126 1 0 2 FF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 3FC24000 00000000 0 0 j 

Lines 7-8 Calculation: Load B, B * XO 

Operations: S.6 C.7, B -* RA.8 RA.8 * C.7 -* P.10 00 

f- 

07 0 1 040 1 0 2 DF 0 0 1 0 0 1 0 0 0 0 3 1 1 3 40E00000 00000000 0 0 ^ 

08 10 040 102 DF 001001000031 13 40E00000 00000000 0 0 <fr 



Lines 9-10 Calculation: 2 - (B * XO) 

Operations: 2 - P.10 S. 12 

09 00 202 002 FB 00001 10000 31 1 3 00000000 00000000 0 0 

10 1 0 202 0 0 2 FB 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 



Lines 11-12 Calculation: X1 = X0(2-B * XO) 
Operation: C.7 * S.1 2 -* P.14 

11 0 0 040 002 9F 0 0001 1000031 1 3 00000000 00000000 0 0 

12 1 0 040 0 0 2 9F 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 
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Table 36. 



Single-Precision Newton-Raphson Binary Division (Concluded) 



Lines 13-14 Calculation: B * X1 

Operation: RA.8 * P.14 P.16 

13 0 0 040 002 EF 00001 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

14 1 0 040 002 EF 00001 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 



Lines ^5-16 Calculation: 2 - (B * X1) 

Operations: P.14 — C.1 5, 2 - P.16 S.18 

15 0 0 202 0 0 2 FB 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

16 1 0 202 0 0 2 FB 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 



Lines 17-18 Calculation: X2 = X1(2 - B * X1) 

Operations: A — RA.1 8, C.1 5 * S.1 8 P.20 

17 0 0 040 002 9F 00101 1000031 13 41B00000 00000000 0 0 

18 1 0 040 0 0 2 9F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 41B00000 00000000 0 0 



Lines 19-20 Calculation: A * X2 

Operations: RA.1 8 * P.20 — P.22 



> 19 0 0 040 0 02 EF 00001 1000031 1 3 00000000 00000000 0 0 
<3 20 1 0 040 0 0 2 EF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

00 
CO 

to 



Lines 21-22 Operation: P.22 -» Y 



21 0 0 020 002 EF 00001 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

22 1 0 020 0 0 2 EF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 
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Double-Precision Newton-Raphson Binary Division 

If the value of B is given as a double-precision number and XO is looked up in a double- 
precision seed ROM, no conversions are required prior to performing a double-precision 
division using the Newton-Raphson algorithm. Three iterations are used in the double- 
precision example (n = 3). The following formula represents the sequence of 
calculations to be performed: 

A/B = A * XO * (2 - B * XO) * [2 - B *'X0 * (2 - B * XO)] 

* (2 - B * XO .(2 - B * XO) * [2 - B * XO (2 - B * XO)]) 

Table 37 shows a double-precision division using a double-precision seed ROM. The 
example divides 22/7. 



Table 37. Double-Precision Newton-Raphson Binary Division 



Lines 1-4 Calculation: B * XO 

Operations: B — RA.4, XO RB.4, RA.4 * RB.4 -* P.8 

01 0 0 1C0 0 0 2 FF 0 0 0 0 1 1000031 1 3 3FC24000 00000000 0 0 

02 10 1C0 002 FF 00001 1000031 13 3FC24000 00000000 0 0 

1 3 401C0000 00000000 0 0 
1 3 401C0000 00000000 0 0 



03 0 0 1C0 0 0 2 FF 0 0 1 1 1 

04 1 0 1C0 0 0 2 FF 0 0 1 1 1 



10 0 0 0 3 1 
10 0 0 0 3 1 



Lines 5-8 



Calculation: 
Operation: 



(B * XO) 
P.8 S.12 



05 0 0 382 0 0 2 FB 0 0 0 0 

06 1 0 382 0 0 2 FB 0 0 0 0 

07 0 0 382 0 0 2 FB 0 0 0 0 

08 1 0 382 0 0 2 FB 0 0 0 0 



0 0 0 0 3 

0 0 0 0 3 

0 0 0 0 3 

0 0 0 0 3 



1 1 3 00000000 00000000 0 0 
1 1 3 00000000 00000000 0 0 
1 13 00000000 00000000 0 0 
1 1 3 00000000 00000000 0 0 



Lines 9-12 



Calculation: 
Operation: 



X1 = X0(2-B * XO) 
RB.4 * S.12 - P.16 



09 0 0 1C0 0 0 2 BF 0 0 0 0 1 

10 1 0 1C0 0 0 2 BF 0 0 0 0 1 

11 0 0 1C0 0 0 2 BF 0 0 0 0 1 

12 1 0 1C0 0 0 2 BF 0 0 0 0 1 



1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 



CO 
CO 
00 

o 
< 

z 
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Table 37. Double-Precision Newton-Raphson Binary Division (Continued) 



Lines 13-16 Calculation: B * X1 

Operations: RA.4 * P. 16 -» P. 20 

13 0 0 1C0 0 0 2 EF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

14 1 0 1C0 0 0 2 EF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

15 0 0 1C0 0 0 2 EF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

16 1 0 1C0 0 0 2 EF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 



Lines 17-20 Calculation: 2 - (B * X1) 

Operations: P. 16 C.18, 2 - P.20 -* S.24 

17 0 0 382 002 FB 00001 1 00 0031 1 3 00000000 00000000 

18 1 1 382 0 0 2 FB 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

19 0 0 382 0 0 2 FB 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

20 1 0 382 0 0 2 FB 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 



Lines 21-24 Calculation: X2 = XK2-B * X1) 
Operations: C.18 * S.24 P.28 

21 0 0 1C0 0 0 2 9F 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

22 1 0 1C0 0 0 2 9F 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

23 0 0 1C0 0 0 2 9F 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

24 1 0 1C0 0 0 2 9F 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 



Lines 25-28 Calculation: B *■ X2 

Operations: RA.4 * P. 28 — P. 32 

25 0 0 1C0 0 0 2 EF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

26 1 0 1C0 0 0 2 EF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

27 0 0 1C0 0 0 2 EF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

28 1 0 1C0 0 0 2 EF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 



Lines 29-32 Calculation: 2 - (B * X2) 

Operations: P.28 -* C.30, 2 - P.32 S.36 

29 0 0 382 0 0 2 FB 00001 1000031 13 00000000 00000000 

30 1 1 382 0 0 2 FB 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

31 0 0 382 0 0 2 FB 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

32 1 0 382 0 0 2 FB 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 
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Table 37. 



Double-Precision Newton-Raphson Binary Division (Concluded) 



Lines 33-36 Calculation: X3 = X2(2-B * X2) 

Operations: A -* RA.36, C.30 * S.36 -* P. 40 



33 0 0 1C0 0 3 2 9F 0 0 1 

34 1 0 1C0 0 3 2 9F 0 0 1 

35 0 0 1C0 0 3 2 9F 0 0 1 

36 1 0 1C0 0 3 2 9F 0 0 1 



Lines 37-40 



Calculation: 
Operations: 



1 1 1 0 0 0 0 3 1 1 3 40360000 00000000 0 0 

1 1 1 0 0 0 0 3 1 1 3 40360000 00000000 0 0 

1 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

1 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 



A * X3 

RA.36 * P.40 P.44 



37 0 0 


1C0 0 


0 2 EF 0 0 0 


0 


1 


1 


0 0 


0 


0 


3 


1 


1 


3 


00000000 


00000000 


0 


0 


38 1 0 


1C0 0 


0 2 EF 0 0 0 


0 


1 


1 


0 0 


0 


0 


3 


1 


1 


3 


00000000 


00000000 


0 


0 


39 0 0 


1C0 0 


0 2 EF 0 0 0 


0 


1 


1 


0 0 


0 


0 


3 


1 


1 


3 


00000000 


00000000 


0 


0 


40 1 0 


1C0 0 


0 2 EF 0 0 0 


0 


1 


1 


0 0 


0 


0 


3 


1 


1 


3 


00000000 


00000000 


0 


0 



Lines 41-44 Operation: P.44.MSH Y 



41 


0 0 


120 0 


0 2 FF 0 0 0 


0 


1 


1 


0 0 


0 


0 


3 


1 


1 


3 


00000000 


00000000 


0 


0 


42 


1 0 


120 0 


0 2 FF 0 0 0 


0 


1 


1 


0 0 


0 


0 


3 


1 


1 


3 


00000000 


00000000 


0 


0 


43 0 0 


120 0 


0 2 FF 0 0 0 


0 


1 


1 


0 0 


0 


0 


3 


1 


1 


3 


00000000 


00000000 


0 


0 


44 


1 0 


120 0 


0 2 FF 0 0 0 


0 


1 


1 


0 0 


0 


0 


3 


1 


1 


3 


00000000 


00000000 


0 


0 



Line 45 



Operation: P.44.LSH — Y 



CO 

oo 
oo 

H 
O 
< 

Z 

C/) 



45 0 0 120 0.0 2 FF 0 0 0 0 1 0 0 0 0 0 3 1 1 3 00000000 00000000 0 0 
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Binary Square Root Using the Newton-Raphson Algorithm 

Square roots may be calculated iteratively using the Newton-Raphson algorithm. The 
procedure is similar to Newton-Raphson division and involves evaluating the following 
expression: 

A = B * Xn 

where Xn = the value of X after the nth iteration given 

Xi + 1 = 0.5 * Xi * [3 - B ■ * (Xi ZZ 2)] 

XO = a guess at 1/sqrt(B) where 0 < XO < sqrt(3/B) 

and n = number of iterations to achieve the desired precision 
Single-Precision Square Root Using a Double-Precision Seed ROM 

When the value of B is given in single precision, it must be converted to a double- 
precision number before it can be used to address a double-precision seed ROM. Since 
the seed XO is stored as a double-precision number, it must first be converted to single 
precision before it is used in the calculation. 

Two iterations (n = 2) are used in a single-precision calculation so the following 
expression for sqrt(B) is to be evaluated: 

A = B * X2 

where X2 = 0.5 * X1 * [3 - B * (X1 2)] 

and X1 = 0.5 * XO * [3 - B * (XO 2)] 

A = B * 0.5 * 0.5 * XO * [3 - B * (XO 2)] 

* [3 - B * (0.5 * XO' * [3 - B * (XO 2)]) 2] 



(/> 
Z 

> 

o 



00 
00 

CO 
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Table 38. Single-Precision Binary Square Root 



Lines 1-2 Calculation: B s.p. -* d.p. 

Operations: B — RA.1 , (s.p. to d.p.)(RA.1 ) — S.2 

01 0 0 026 1 1 3 FF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 40000000 00000000 0 0 

02 1 0 026 1 1 3 FF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 40000000 00000000 0 0 



Lines 3-4 Calculation: Load XO 

Operation: XO RA.4 

03 0 0 1 26 1 0 2 FF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 3FE6A000 00000000 0 0 

04 1 0 1 26 1 0 2 FF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 3FE6A000 00000000 0 0 



Lines 5-6 Calculation: XO d.p. -* s.p. 

Operations: (d.p. to s.p.)(RA.4) S.6 

05 0 0 1 26 1 0 2 FF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 3FE6A000 00000000 0 0 

06 1 0 126 1 0 2 FF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 3FE6A000 00000000 0 0 



Lines 7-8 Calculation: Load B, B * XO rv 

Operations: S.6 - C.7, B - RB.8, RB.8 * C.7 - P. 10 g 

00 

07 01 040 102 7F 000101000031 13 40000000 00000000 0 0 h 

08 1 0 040 1 0 2 7F 0 0 0 1 0 1 0 0 0 0 3 1 1 3 40000000 00000000 0 0 ^ 

Lines 9-10 Calculation: B ■ * XO 2 ^ 
Operations: P. 10 ■* C.7 - P.1 2, 3 - RA.10 - S.1 2 

09 0 0 260 0 0 2 6F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 40400000 00000000 0 0 

10 1 0 260 0 0 2 6F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 40400000 00000000 0 0 



Lines 11-12 Calculation: 3 - (B * XO 2) 

Operation: S.12 - P.12 -* S.14 

1 1 0 0 003 0 0 2 FA 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

12 1 0 003 0 0 2 FA 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 
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Table 38. Single-Precision Binary Square Root (Continued) 



C/5 
Z 

> 

o 



Lines 13-14 Calculation: XO * (3 - (B * XO 2)) 

Operations: C. 7 * S. 14 — P.1 6, 1/2 - RA.14 - S. 16 

13 0 0 260 0 0 2 9F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 3F000000 00000000 0 0 

14 1 0 260 0 0 2 9F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 3F000000 00000000 0 0 



Lines 15-16 Calculation: 1/2. * XO *.(3-(B * XO- 2)) — X1 
Operations: S.16 * P. 16 -* P.18, 0 RA.16, 
RA.16 + RB.8 S.18 

1 5 0 0 240 0 0 2 AF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 
16 1 0 240 0 0 2 AF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 



Lines 17-18 Calculation: B * X1 

Operations: S.1 8 * P.1 8 — P.20 

17 0 0 040 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

18 1 0 040 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 



Lines 19-20 Calculation: B * X1 2 

Operations: P.1 8 - C.1 9, P.20 ■* C.1 9 P.22, 
3-*RA.20-*S22 



00 19 0 1 260 0 0 2 6F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 40400000 00000000 0 0 
Ca) 20 1 0 260 0 0 2 6F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 40400000 00000000 0 0 

Lines 21-22 Calculation: 3 - (B ■* X1 2) 

Operations: S.22 - P.22 -* S.24 

21 0 0 003 0 0 2 FA 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

22 1 0 003 002- FA 00001 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 



Lines 23-24 Calculation: X1 * (3 - (B ^ X1 2)) 

Operations: C.1 9 * S.24 P.26, 1/2 RA.24 S.26 

23 0 0 260 0 0 2 9F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 3F000000 00000000 0 0 

24 1 0 260 0 0 2 9F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 3F000000 00000000 0 0 
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Table 38. Single-Precision Binary Square Root (Concluded) 



Lines 25-26 Calculation: 1/2 * X1 * (3 - (B * XT 2)) — 
Operations: S.26 * P.26 — P.28, 0 -* RA.26, 
RA.26 + RB.8 S.28 



X2 



25 0 0 240 0 0 2 AF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

26 1 0 240 0 0 2 AF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 



Lines 27-28 



Calculation: 
Operations: 



B * X2 A 

S.28 * P.28 P.30 



27 0 0 040 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

28 1 0 040 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 



Lines 29-30 



Calculation: 
Operation: 



NOP 

Y — Output 



29 0 1 00A 0 0 2 FF 0 0 0 0 1 1 0 0 0 0 3 1 1 

30 1 0 00A 0 0 2 FF 0 0 0 0 1 1 0 0 0 0 3 1 1 



3 00000000 00000000 0 0 
3 00000000 00000000 0 0 



Double-Precision Square Root 

The value of B is given as a double-precision number so XO can be looked up from 
a double-precision seed ROM without conversion from one precision to the other. Three 
iterations (n = 3) are required in the double-precision calculation, and the following 
formula for sqrt(B) is to be evaluated: 



A = B * 0.5 * 0.5 * 0.5 * XO * 

* [3 - B * (0.5 * XO * [3 - B * 

* [3 - B * (0.5 * 0.5 * XO * [3 

* [3. - B * (0.5 * XO * [3 - B * 



[3 - B * (XO 
(XO 2)]) 2] 
- B * (XO 2)] 
(XO 2)]) 2]) 2] 



2)] 



CO 
CO 
00 
H 

o 
< 

z 
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Table 39. Double-Precision Binary Square Root 



CO 

2! 

> 

O 
H 
00 
00 
CO 



Lines 1-4 Calculations: Load B, Load XO, B * XO 

Operations: B -* RB.4, XO -* RA.4, RA.4 * RB.4 ~+ P.8 
RA.4-S.8.-C.iO- 



01 0 0 3E0 0 0 2 FF 00001 1000031 

02 1 0 3E0 002 FF 00001 1 0 0 0 0 3 1 

03 0 0 3E0 0 0 2 FF 0 0 1 1 1 1 0 0 0 0 3 1 

04 1 0 3E0 0 0 2 FF 0 0 1 1 1 1 0 0 0 0 3 1 



Lines 5-8 



Calculations: 
Operations: 



B * XO 2 

P.8 * S.8 - P.12, 



05 00 3E0 002 AF 00001 1000031 

06 1 0 3E0 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 

07 0 0 3E0 0 0 2 AF 0 0 1 0 1 1 0 0 0 0 3 1 

08 1 0 3E0 0 0 2 AFOO 1 0 1 1 0 0 0 0 3 1 



1 3 40000000 00000000 0 0 
1 3 40000000 00000000 0 0 
1 3 3FE6AO00 00000000 0 0 
1 3 3FE6A000 00000000 0 0 



3 RA.8 S.12 

1 3 00000000 00000000 0 0 
1 3 00000000 00000000 0 0 
1 3 40080000 00000000 0 0 
1 3 40080000 00000000 0 0 



Lines 9-1 2 



Calculations: 
Operations: 



3 - (B 
S.12 - 



* XO 
P.12 • 



2) 



09 
10 
11 
12 



183 
183 
183 
183 



FA 
FA 
FA 
FA 



►S.16 

3 1 1 3 00000000 00000000 
3 1 1 3 00000000 00000000 
3 1 1 3 00000000 00000000 
3 1 1 3 00000000 00000000 



Lines 13-16 



Calculations: 
Operations: 



XO * (3 - (B 
C.10 * S.16 ~ 



* XO 2» 
P.20, 1/2 



RA.16 S.20 



13 0 0 3E0 0 0 2 9F 0 0 0 0 1 1 0 0 0 0 3 

14 1 0 3E0 0 0 2 9F 0 0 0 0 1 1 0 0 0 0 3 

15 0 0 3E0 002 9F 0010 1 100003 

16 1 0 3E0 0 02 9F 00101 100003 



1 1 3 00000000 00000000 0 0 
1 1 3 00000000 00000000 0 0 
1 1 3 3FE00000 00000000 0 0 
1 1 3 3FE00000 00000000 0 0 
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Table 39. Double-Precision Binary Square Root (Continued) 



Lines 17-20 Calculations: 1/2 '* XO * (3-(B * XO 2)) X1 

Operations: S.20 * P.20 - P.24 - C.25, 0 - RA.20, 
RA.20 + RB.4 — S.-24 

17 0 0 3C0 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

18 10 3C0 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

19 0 0 3C0 0 0 2 AF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

20 1 0 3C0 0 0 2 AF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 



Lines 21-24 Calculations: B * X1 

Operations: S.24 * P.24 -* P.28 

21 0 0 1C0 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

22 1 0 1C00 0 2 AF 0 0 0 0 1 1000031 13 00000000 00000000 0 0 

23 0 0 1C000 2 AF0 00 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

24 101 CO 0 02 AF 00001 1000031 13 00000000 00000000 0 0 



Lines 25-28 Calculations: B * X1 2 

Operations: P.28 * C.25 -* P.32, 3 RA.28 -> S.32 

CO 

25 0 1 3E0 0 0- 2 6F 00001 1000031 1' 3 00000000 00000000 0 0 00 

26 10 3E0 002 6F 00001 1000031 13 00000000 00000000 0 0 

27 0 0 3E0 0 0 2 6F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 40080000 00000000 0 0 O 

28 1 0 3E0 0 0 2 6F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 40080000 00000000 0 0 < 

2 

Lines 29-32 Calculations: 3 - (B * X1 2) w 
Operations: S.32 - P.32 S.36 

29 0 0 183 0 0 2 FA 0 0 0 0 0 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

30 1 0 183 002 FA 00000 1000031 1 3 00000000 00000000 0 0 

31 0 0 183 0 0 2 FA 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

32 1 0. 183 0 -0 2 FA 0. 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 
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Table 39. Double-Precision Binary Square Root (Continued) 



Lines 33-36 Calculations: X1 * (3 - (B '■* XI 2)) 

Operations: C.25 * S.36 P.40, 1/2 RA.36 S.40 

33 0 0 3E0 0 0 2 9F 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

34 1 0 3E0 0 0 2 9F 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

35 0 0 3E0 0 0 2 9F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 3FE00000 00000000 

36 1 0 3E0 0 0 2 9F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 3FE00000 00000000 



Lines 37-40 Calculations: 1/2 * X1 * (3 - (B * X1 2))--X2 

Operations: S.40 * P.40 - P.44 - C.45, 0 - RA.40, 
RA.40 + RB.4 S.44 

37 0 0 3C0 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

38 1 0 3C0 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

39 0 0 3C0 0 0 2 AF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

40 1 0 3C0 0 0 2 AFOO 1 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 



Lines 41-44 Calculations: B * X2 

Operations: S.44 * P.44 -> P.48 

41 0 0 1C0 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

42 1 0 1C0 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

43 0 0 1 CO 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

44 1 0 1 CO 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 



Lines 45-48 Calculations: B * X2 2- 

Operations: P.48 * C.45 P.52/ 3 - RA.48 -*> S.52 

45 0 1 3E0 0 0 2 6F 00001 1000031 13 00000000 00000000 

46 1 0 3E0 0 0 2 6F 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 

47 0 0 3E0 0 0 2 6F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 40080000 00000000 

48 1 0 3E0 0 0 2 6F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 40080000 00000000 
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Table 39. Double-Precision Binary Square Root (Continued) 



Lines 49-52 Calculations: 3 - (B * X2 2) 

Operations: S.52 - P.52 — S.56 

49 0 0 183 0 0 2 FA 0 0 0 0 0 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

50 1 0 183 0 0 2 FA 0 0 0 0 0 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

51 0 0 183 0 0 2 FA 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

52 1 0 183 0 0 2 FA 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 



Lines 53-56 Calculations: X2 * (3 - (B * X2 2)) 

Operations: C.45 * S.56 -* P.60, 1/2 - RA.56 S.60 

53 0 0 3E0 0 0 2 9F 0 0 0 0 1 1000031 1 3 00000000 00000000 0 0 

54 1 0 3E0 0.0-2 9F 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

55 0 0 3E0 0 0 2 9F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 3FE00000 00000000 0 0 

56 1 0 3E0 0 0 2 9F 0 0 1 0 1 1 0 0 0 0 3 1 1 3 3FE00000 00000000 0 0 



Lines 57-60 Calculations: 1/2 * X2 * (3 - (B * X2 )) X3 
Operations: S.60 * P.60 -* P.64, 0 RA.60, 

RA.60 + RB.4 - S.64 ^ 

00 

57 0 0 3C0 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 O g 

58 1 0 3C0 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 h- 

59 0 0 3C0 0 0 2 AF 0 0 1 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 O 

60 1 0 3C0 002 AF 00101 1 000031 1 3 00000000 00000000 0 0 



Lines 61-64 Calculations: B * X3 A 

Operations: S.64 * P.64 P.68 ~> Y.MSH 

61 0 0 1C0 0 0 2 AF 0 0.0 0 1 1 0 0 00 3 1 1 3 00000000 00000000 0 0 

62 1 0 1C0 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

63 001 CO 002 AF 00001 1000031 1 3 00000000 00000000 0 0 

64 1 0 1C0 0 0 2 AF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 



< 

z 
c/> 
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Table 39. Double-Precision Binary Square Root (Concluded) 



Lines 65-68 Calculation: NOP 

Operation: Y.MSH Output 

65 0 1 18A 0 0 2 FF 0 00011 0 0 0 031 13 00000000 00000000 0 0 

66 1 0 18A 0 0 2 FF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

67 0 0 18A 0 0 2 FF 0 0 0 0 1 1 0 0 0 0 3 1 1 3 00000000 00000000 0 0 

68 1 0 18A 002 FF 0 0 00 1100 0031 13 00000000 00000000 0 0 



Line 69 Calculation: NOP 

Operation: Y.LSH Output 

69 0 0 18A 0 0 2 FF 0 0 0 0 1 0 0 0 0 0 3 1 1 3 00000000 00000000 0 0 



5 
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GLOSSARY 



Biased exponent — The true exponent of a floating point number plus a constant called 
the exponent field's excess. In IEEE data format, the excess or bias is 1 27 for single- 
precision numbers and 1023 for double-precision numbers. 

Denormalized number (denorm) — A number with an exponent equal to zero and a 
nonzero fraction field, with the implicit leading (leftmost) bit of the fraction field being 0. 

NaN (not a number) - Data that has no mathematical value. The / ACT8837/'ACT8847 
produces a NaN whenever an invalid operation such as 0 •* oo is executed. The output 
format for an NaN is an exponent field of all ones, a fraction field of all ones, and a 
zero sign bit. Any number with an exponent of all ones and a nonzero fraction is treated 
as a NaN on input. 

Normalized number — A number in which the exponent field is between 1 and 254 
(single precision) or 1 and 2046 (double precision). The implicit leading bit is 1. 

Wrapped number — A number created by normalizing a denormalized number's fraction 
field and subtracting from the exponent the number of shift positions required to do 
so. The exponent is encoded as a two's complement negative number. 
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Implementing a Double-Precision Seed ROM 

The seed ROM assumed in the previous microcode examples is a double-precision seed 
ROM containing both division and square root seeds. Six chips are necessary to build 
this seed ROM: five 4x4096 registered PROMs and one latch (ordinarily implemented 
in a PAL). Figure 26 shows a sample implementation for a double-precision seed ROM. 

Three of the PROMs are for generating the exponent part of the seed. All 1 1 exponent 
lines are necessary to accurately determine the exponent of the seed. There are 
12 address lines in a 4x1024 PROM, so the last address line can be used for a 
microcode bit that tells whether a divide or square root seed is being read . Since there 
are only 1 1 bits in the exponent and three PROMs are used, there are 1 2 output bits 
but one bit is not used. The equations giving the contents of the PROMs is given in 
a later section. 

The other two PROMs generate the mantissa part of the seed. One address line of 
the PROMs is used for the microcode bit telling whether a divide or square root seed 
is to be used. For a square root seed, the least significant bit of the exponent is needed 
in generating the mantissa seed. Therefore, another address line of the PROMs is used 
by the least significant exponent bit. This leaves 10 address lines to be used to look 
up the mantissa seed. Since there are eight output bits from the two PROMs, an eight- 
bit seed is generated. 

KB The sign bit of B needs to be preserved for use when the seed is read. In the case 
QJof binary division, this requirement is obvious. In the square root calculation, the sign 
bit of B should always be zero. This condition should be tested by the microprogram. 

2j Since every real square root has two answers, normally the positive answer is assumed. 

^ However, since the sign of B is meaningless to Newton-Raphson unless it is positive, 

> the example microprograms assume that a negative B simply means that the negative 

Q of the square root of B is the desired answer instead of the positive root. This is 

00 accomplished by using the absolute value of B in all computations except for looking 

9° up the seed. If the seed is negative, then the answer generated will be the negative root. 
CO 

^ PROM Contents 

Because one address line of the PROMs selects divide or square root, the PROMs can 
be considered to be divided functionally into two halves: the divide half and the square 
root half. Each functional half is discussed separately in the sections below. 

Divide PROMs 

The exponent part of the seed is defined in the following manner. Assuming that 
B = m * (2 e ) and XO = m' * (2 e '), e' is computed as e' = -e. Using the definition 
of an IEEE number, the value of m ean be represented as a number within the following 
interval: 1 < m < 2. 
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Figure 26. IEEE Double-Precision Seed ROM for 
IMewton-Raphson Division and Square Root 
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This range of values of m can be subdivided into two cases: 
m = 1, or 1 < m < 2 




Since m' is computed as m' = 1 / m, the range of m' will be 

m' = 1, or 1/2 < m' < 1 

To be represented as a normalized IEEE number, m" would be 

m" = m' * {2h = 2/m (1) 

This would make the range of m" 

m" = 2, or 1 < m" < 2 

This is still not quite in the range of a valid IEEE number; however, m" = 2 only when 
m = 1. Therefore, m" can be forced to be just less than 2 in this case. 

Since XO = m' * (2 e ), to use m" in the PROMs, we must have an e" in the exponent 
such that XO = m" * (2 e "). This is true for e" = e' -1. Since, XO = m" * (2e"), 
the following substitution can be made: 

XO = (m' * (2D) * {2te'-1>) 

= m' * (21) * (2©') * (2( — 1)) 
= m' * (2e') * (2d-D) 
= m' * (2®') * (20) 
= m' * (2©') 

^ Therefore, if e" is used in the exponent PROMs and m" is used in the mantissa PROMs, 
a normalized IEEE seed can be generated. The only exception to the formula is that 
4^ f or m = 1 , 
> 

m" = 2 / m - delta 
Oo Where delta = 2(-8> 

So m" = 2 / m, and e" = (-e) - 1. 

Since IEEE exponents are represented in excess 1023 notation, a formula for X" must 
be determined, given that X is the IEEE exponent. As an IEEE exponent, 
X = e + 1023 -e = X - 1023 and X" = e" + 1023. So, for -X" in terms of X, 

X" = e" + 1023 

= (-e) - 1 +1023 
= (- (X - 1023)) + 1022 
= 1023 - X + 1022 
= 2045 - X 

So given the 1 1 bits of X as address of the seed exponent, the value stored at address 
X is 

X" = 2045 - X (2) 
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Given that the mantissa seed ROM uses 1 0 bits of the mantissa to determine the seed, 
each seed Xm will be used for some range of mantissas, Bm to (Bm + 2 * delta). 
The formula for Xm is from formula (1). 

2/Bm — Xm 

2/(Bm + 2 * delta) Xm 

Where delta = 2("H) 

This value is used since the actual Xm should be generated by the mantissa in the 
center of the given range: 

Xm = 2/(Bm + delta) 

This would result in a more accurate seed on the average. Therefore, the formula used 
to generate the mantissa part of the seed is 

Xm = 2/(Bm + (2(-H))) (3) 

Square Root PROMs 

The seed for the square root, XO, is actually the reciprocal of the square root of the 
data, B: 

XO = 1 / (BH/2)) 

Given B = m * (2 e ) and XO = m' * (2 e '), the expression for XO can be evaluated 
by substitution and reduction: 

XO = 1 / ((m * (2e))H/2)) £ 
= 1 Mm<!/2) * ( 2 (e/2))) £> 
= m (-1/2) * (2(~e/2)) 00 

Then m' and e' may be written as m' = m^~''/2) anc j e ' - -e/2. ^ 

Next, it is necessary to verify that the above m' and e' form a valid normalized IEEE 
number. When e is an odd number, e' is not an integer and, therefore, it is not valid 2 
IEEE exponent. If the above expression is separated into two cases, e' can be ^ 
represented in terms of a valid IEEE exponent, e": 

e' = - e/2 for e even 

e' = e" + 1/2 for e odd 

Rewriting e" in terms of e produces this expression: 

e" = e' - 1/2 = (-e/2) - 1/2 for e odd 

Then a valid IEEE exponent, e" , can be written for all e as 

e" = -e/2 for e even 

e" = (-e/2) - 1/2 for e odd 



5-127 



This is equivalent to e" = int( - e/2) for all e. However, the 1/2 affects the mantissa: 
XO = m' * (2®') 

XO = m' * (2(e" + V2)). for odd e 

XO = m' * (21/2) * (2e") for odd e 

Since XO = m" * (2 e ") m" can be rewritten as 

m" = m' for even e 

m" = m' * (2 1/2) for odd e 

In terms of m, m" = m~~1/2 f or even e 

m" = ( m -1/2j * (21/2) for odd e 

Simplifying m" for odd e, 

m" = (1/m1/2) * (21/2) for odd e 

m" = (2/m1/2) for odd e 

Just as the divide exponent needed to be converted to excess 1 023 notation, so the 
same must be done for the square root: 

X" = e" + 1023 
X = e + 1023 
X" = int(-e/2) + 1023 
X" = int((1023-X) / 2) + 1023 

m The IEEE bits for the exponent seed, X", can be expressed in terms of the IEEE bits 
2 for the exponent of B, X: 

£ X" = int( (1023-X) / 2 ) + 1023 

■£j Because the formula for m" depends on the least significant bit of e, that bit must 

00 be used as an address line to the mantissa. 

00 

W Since X = e + 1 023, an odd value of e will result in an even value of X, and an even 
^ value of e will result in an odd value of X. Therefore, 

m" = m-1/2 for odd X 

m" = 2/m 1 /2 for even X 
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SN74ACT8818 16-Bit Microsequencer 



SN74ACT8832 32-Bit Registered ALU 




SN74ACT8836 32- x 32-Bit Parallel Multiplier 




SN74ACT8837 64-Bit Floating Point Processor 



SN74ACT8841 Digital Crossbar Switch 



SN74ACT8847 64-Bit Floating Point/Integer Processor 




Support 



Mechanical Data 
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SN74ACT8841 
Digital Crossbar Switch 

The SN74ACT8841 is a single-chip digital crossbar switch that cost-effectively 
eliminates bottlenecks to speed data through complex bus architectures. 

The 'ACT8841 has 16 four-bit bidirectional ports which can be connected in 
any conceivable combination. Total time for data transfer is 1 4-ns flowthrough. 

The 'ACT8841 is ideal for multiprocessor application, where memory bottlenecks 
tend to occur. For example, four 32-bit buses can be easily connected by two 
'ACT8841 devices. System architectures based on the 16-port 'ACT8841 can 
include up to 1 6 switching nodes (i.e., processors, memories, or bus interfaces). 
Larger processor arrays can be built with multistage interconnect schemes. 
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SN74ACT8841 
DIGITAL CROSSBAR SWITCH 



GB PACKAGE 
(TOP VIEW) 



• High-Speed Programmable Switch for 
Parallel Processing Applications 

• Dynamically Reconfigurable for Fault- 
Tolerant Routing 

• 64 Bidirectional Data l/Os in 16 Nibble 
(Four-Bit) Groups 

• Data I/O Selection Programmable by Nibble 

• Eight Banks of Control Flip-Flops for Storing 
Configuration Programs 

• Two Selectable Hard-Wired Switching 
Configurations 

• Selectable Stored-Data or Real-Time Inputs 

• 1 56-Pin Grid-Array Package 

• CMOS 1 fim EPIC™ Process 

• Single 5-V Power Supply 

description 

The SN74ACT8841 is a flexible, high-speed digital crossbar switch. It is easily microprogrammable to 
support user-definable interconnection patterns. This crossbar switch is especially suited to multiprocessor 
interconnects that are dynamically reconfigurable or even reprogrammable after each system clock. The 
'ACT8841 is built in Texas Instruments advanced 1 /xm EPIC CMOS process to enhance performance 
and reduce power consumption. The switch requires only a 5-V power supply. 

Because the 'ACT8841 is a 16-port device, system architectures based on the 'ACT8841 can include 
up to 16 switching nodes, which may be processors, data memories, or bus interfaces. Larger processor 
arrays can be built with multistage interconnection schemes. Most applications will use the crossbar switch 
as a broadband bus interface controller, for example, between closely coupled processors which must 
exchange data with very low propagation delays. 

The 'ACT8841 has ten selectable control sources, including eight banks of programmable control flip-flops 
and two hard-wired control circuits. The device can switch from 1 to 16 nibbles (4 to 64 bits) of data 
in a single cycle. 

The 64 I/O pins of the 'ACT8841 are arranged in 1 6 switchable nibbles (see Figure 1). A single input nibble 
can be broadcast to any combination of 1 5 output nibbles, or even to 1 6 nibbles (including itself) if operating 
off registered data. Multiple input nibbles can be switched to multiple outputs, depending on the programmed 
configurations available in the control flip-flops. 

The digital crossbar switch is intended primarily for multiprocessor interconnection and parallel processing 
applications. The device can be used to select and transfer data from multiple sources to multiple 
destinations. Since it can be dynamically reprogrammed, it is suitable for use in reconfigurable networks 
for fault-tolerant routing. 
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SN74ACT8841 

DIGITAL CROSSBAR SWITCH 



description (continued) 

The 'ACT8841 and the bipolar SN74AS8840 share the same architecture. Microcode for the 'AS8840 
can be run on the 'ACT8841 if the additional control inputs to the 'ACT8841 are properly terminated. 
However, because the 'ACT8841 is a CMOS device with six additional control inputs, the 'AS8840 and 
the 'ACT8841 are not socket-compatible and cannot be used interchangably. A summary of the differences 
between the SN74AS8840 and the SN74ACT8841 is provided in the 'AS8840 and 'ACT8841 
FUNCTIONAL COMPARISON at the end of the data sheet. 

The SN74ACT8841 is characterized for opertion from 0°C to 70°C. 



Table 1. ACT8841 Pin Grid Allocation 



NO. 


3 IN 

NAME 


NO. 


PIN 

NAME 


NO. 


PIN 

NAME 


NO. 


PIN 

NAME 


A1 


GND 


CIO 


D31 


H12 


V CC 


N7 


CNTR13 


A2 


GND 


en 


OED6 


H13 


LSCLK 


N8 


CREADO 


A3 


D37 


C12 


V CC 


H14 


SELDLS 


N9 


V CC 


A4 


D35 


C13 


GND 


H15 


CNTR3 


N10 


DO 


A5 


D33 


C14 


D23 


J1 


0"EC 


N1 1 


D3 


A6 


We 


C15 


D21 


J2 


CRWRITEO 


N12 


D6 


A7 


CRADR1 


D1 


D43 


J3 


CRWRITE1 


N13 


GND 


A8 


CNTR7 


D2 


D42 


J4 


GND 


N14 


D8 


A9 


CNTR4 


D3 


V CC 


J12 


GND 


N15 


D9 


A10 


0TED7 


D7 


GND 


J13 


CNTR2 


P1 


GND 


A11 


D29 


D8 


V CC 


J14 


CNTR1 


P2 


GND 


A12 


D27 


D9 


GND 


J15 


CNTRO 


P3 


D56 


A13 


D25 


D13 


D22 


K1 


CRWRITE2 


P4 


D58 


A14 


GND 


D14 


D20 


K2 


OED12 


P5 


D60 


A15 


GND 


D15 


D19 


K3 


D48 


P6 


D62 


B1 


GND 


E1 


D45 


K13 


D15 


P7 


CNTR12 


B2 


GND 


E2 


D44 


K14 


D14 


P8 


CNTR15 


B3 


D39 


E3 


6ED10 


K15 


OED3* 


P9 


TPO 


B4 


D36 


E13 


SED5 


L1 


D49 


P10 


OEDO 


B5 


D34 


E14 


D18 


L2 


D50 


P1 1 


D2 


B6 


OBD8 


E15 


D17 


L3 


01515 


P12 


D4 


B7 


CRADRO 


F1 


0ED1 1 


LI 3 


OED2 


P13 


D7 


B8 


CRSRCE 


F2 


D46 


L14 


D12 


P14 


GND 


B9 


CNTR5 


F3 


D47 


L15 


D13 


P15 


GND 


B10 


D30 


F13 


D16 


M1 


D51 


R1 


GND 


B11 


D28 


F14 


0ED4 


M2 


D52 


R2 


GND 


B12 


D26 


F15 


CRSEL3 


M3 


D54 


R3 


D57 


B13 


D24 


G1 


CNTR8 


M7 


GND 


R4 


D59 


B14 


GND 


G2 


CNTR9 


M8 


v cc 


R5 


D61 


B15 


GND 


G3 


CNTR10 


M10 


GND 


R6 


OED15 


C1 


D41 


G4 


GND 


M13 


vcc 


R7 


CNTR14 


C2 


D40 


G12 


GND 


M14 


D10 


R8 


CREAD1 


C3 


GND 


G13 


CRSEL2 


M15 


D11 


R9 


CREAD2 


C4 


D38 


G14 


CRSEL1 


N1 


D53 


R10 


TP1 


C5 


5ED9 


G15 


CRSELO 


N2 


D55 


R11 


D1 


C6 


D32 


H1 


CNTR11 


N3 


GND 


R12 


OEDT 


C7 


v C c 


H2 


SELDMS 


N4 


v cc 


R13 


D5 


C8 


CRCLK 


H3 


MSCLK 


N5 


0ED14 


R14 


GND 


C9 


CNTR6 


H4 


v C c 


N6 


D63 


R15 


GND 
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DIGITAL CROSSBAR SWITCH 



Table 2. ACT8841 Pin Functional Description 



PIN 


N ° 


I/O 


DESCRIPTION 


CNTRO 


J15 






CNTR1 


J14 






GNTR2 


J13 














CNTR4 


A9' 5 






CNTR5 








CNTR6 


C9 






CNTR7 


A8 


I/O 


Control I/O. Inputs four control words to tn6 control flip~flops on ©sen CHCLK cyci©. As outputs, tn© 


CNTR8 


G1 


same addresses can be used to read the flip-flop settings. 


CNTR9 


G2 






CNTR1 0 


G3 














CNTR12 


P7 






CNTR1 3 








CNTR1 4 


R7 






CNTR1 5 


P8 






CRADRO 


87 


I 


Control register address. Selects 16-bits of control flip-flops as a source/destination for outputs/inputs 


CRADR1 


A7 




on CNTR0-CNTR1 5. (see Table 7) 


CRCIK 


C8 


I 


Control register clock. Clocks CNTR0-CNTR15 into the control flip-flops on low-to-high transition. 


CREADO 
CREAD1 
CREAD2 


N8 
R8 
R9 


I 


Selects one of eight banks of control flip-flops to read out on CNTR0-CNTR15 in 16-bit words 
addressed by CRADR1 -CRADRO. 


CRSELO 


G15 






CRSEL1 
CRSEL2 


G14 
G13 


1 


Selects one of ten control configurations. 


CRSEL3 


F15 






CRSRCE 


B8 


I 


Load source select. When low selects CNTR inputs, when high selects DATA inputs. 
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SN74ACT8841 

DIGITAL CROSSBAR SWITCH 



Table 2. ACT8841 Pin Functional Description (continued) 



PIN 
NAME 


NO. 


I/O 


DESCRIPTION 


CRWRITEO 


J2 






CRWRITE1 


J3 


I 


Destination select. Selects one of eight control banks, (see Table 4) 


CRWRITE2 


K1 






DO 


N10 






D1 


R1 1 






D2 


P1 1 






D3 


N1 1 






D4 


P12 






D5 


R13 






D6 


N12 






D7 


P13 






D8 


N14 






D9 


N15 






D10 


M14 






D1 1 


M15 






D12 


L14 






D13 


L15 






D14 


K14 






D15 


K13 






D16 


F13 


I/O 


I/O data bits 0 through 31 (data bits 0 through 31 are the least significant half). 


D17 


E15 






D18 


E14 






D19 


D15 






D20 


D14 






D21 


C1 5 






D22 


D13 






D23 


C14 






D24 


B1 3 






D25 


A13 






D26 


B12 






D27 


A12 






D28 


B11 






D29 


A11 






D30 


B10 






D31 


C10 






D32 


C6 






D33 
D34 


A5 
B5 


I/O 


I/O data bits 32 through 35 (data bits 32 through 63 are the most significant half). 


D35 


A4 
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SN74ACT8841 
DIGITAL CROSSBAR SWITCH 



Table 2. ACT8841 Pin Functional Description (continued) 



NAME 


NO. 


D36 


B4 


037 


A3 


D38 


C4 


039 


B3 


D40 


C2 


041 


C1 


D42 


02 


043 


D1 


044 


E2 


D45 


E1 


D46 


F2 


D47 


F3 


D48 


K3 


D49 


L1 


050 


L2 


D51 


Ml 


052 


M2 


D53 


N1 


D54 


M3 


D55 


N2 


D56 


P3 


057 


R3 


D58 


P4 


059 


R4 


060 


P5 


061 


R5 


062 


P6 


D63 


N6 


GND 


A1 


GND 


A2 


GND 


A14 


GND 


A15 


GND 


B1 


GND 


B2 


GND 


B14 


GND 


B15 


GND 


C3 


GND 


C13 


GND 


07 


GND 


D9 


GND 


G4 


GND 


G12 



DESCRIPTION 



I/O data bits 36 through 63 (data bits 32 through 63 are the most significant half). 



Ground (all pins must be used). 
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SN74ACT8841 

DIGITAL CROSSBAR SWITCH 



Table 2. ACT8841 Pin Functional Description (continued) 



PIIN 

NAME 




I/O 


D t SCRIPT IUiM 


NO 




GND 


J4 






GND 


J12 * 






GND 


M7 






GND 


M10 






GND 


N3 






GND 


N13 






GND 


P1 






GND 


P2 




Ground (all pins must be used). 


GND 


P14 






GND 


P15 






GND 


R1 






GND 


R2 






GND 


R14 






GND 


R15 






LSCLCK 


H13 


I 


Clocks the least significant half of data inputs into the input registers on a low-to-high transition. 


MSCLK 


H3 


I 


Clocks the most significant half of data inputs into the input registers on a low-to-high transition. 


OEC 


J1 


1 


Output enable for control flip-flops, active low 


OEDO 


P10 






OED1 


R12 






OED2 


L13 






OED3 


K15 






0ED4 


F14 






OED5 


E13 






OED6 


C11 






OED7 


A10 






OED8 


B6 


1 


Output enables for data nibbles, active low 


OED9 


C5 






OED10 


E3 






OED11 


F1 






OED12 


K2 






UED13 


L3 






0ED14 


N5 






OED15 


R6 
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Table 2. ACT8841 Pin Functional Description (concluded) 



PIN 

NAME NO. 


I/O 


DESCRIPTION 


SELDLS H14 


I 


When low, selects the stored, least significant data input to the main internal bus. When high, real- 
time data is selected. 


SELDMS H2 


I 


When low, selects the stored, most significant data input to the main internal bus. When high, real- 
time data is selected. 


TPO P9 
TP1 R10 


I 


Test pins. High during normal operation, (see Table 9) 


V C C C7 
V C C C12 
V C C D3 
VCC °8 
V C C "4 
V CC 'HI 2 
V C C M8 
V C C M13 
V C C N4 
Vcc N9 




5-V supply 


WE A6 


I 


Write enable for control flip-flops, active low 



overview 

The 64 I/O pins of the 'ACT8841 are arranged in 16 nibble (four-bit) groups where each set of four pins 
serves as bidirectional inputs to and outputs from a nibble multiplexer. During a switching operation, each 
nibble passes four bits of either stored or real-time data to the main internal 64-bit data bus. Each output 
multiplexer will independently select one of the 16 nibbles from this 64-bit data bus. 

Data nibbles are organized into two groups: the least significant half (D31-D0) and the most significant 
half (D63-D32). Stored versus real-time data inputs can be selected separately for the LSH and the MSH. 
Two clock inputs, LSCLK and MSCLK, are available to latch LSH and MSH data inputs, respectively, into 
the data register. 

The pattern of output nibbles resulting from the switching operation is determined by a selectable control 
source, either one of eight banks of programmable control flip-flops or one of two hard-wired switching 
configurations. Inputs to the control flip-flops can be loaded either from the data bus or from control l/Os. 
A separate clock (CRCLK) is provided for loading the banks of control flip-flops. 
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SN74ACT8841 

DIGITAL CROSSBAR SWITCH 
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logic symbol 



CO 
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o 

H 
00 
00 



OEC- 

CNTR3 CNTR0 - 
CNTR7 CNTR4 - 
CNTR1 1 -CNTR8 - 
CNTR15 CNTR12- 

LSCLK - 
SELDLS - 



DIGITAL CROSSBAR SWITCH 
* 

ACT8841 



WRITE EN 
REG BANK 



DESTINATION 
SOURCE 



EN 




0 






CONTROL 




V 


15 





>CLK 
SELECT 



OEDO 

D3 DO-v^ 



MUX 
-N EN 

0 IV 



D7 D4-^4> 



OED2- 
D11 D8- 



D19 D16~/^ 



D27 D24-/^ 



OED7 - 



D31 D28 —/+ 



MUX 
EN 



MUX 
EN 

8 |V 
111 



MUX 
EN 

izly 

151 



MUX 
EN 

161 V 
191 



MUX 
EN 

201 y 
23 1 



MUX 
EN 

2 4! S7 
271 



MUX 
EN 

281 V 
3l! 



CONTROL 
REGISTER 



SELECT 

ADDRESS 
TEST 



^ DATA ^ 



CLK < 
SELECT 

c 



- WE 

- CREADO 

- CREAD1 

- CREAD2 

- CRWRITEO 

- CRWRITE1 

- CRWRITE2 

- CRSRCE 

- CRSELO 
- CRSEL1 

- CRSEL2 

- CRSEL3 
- CRADR0 
-CRADR1 

- TP0 

- TP1 

- MSCLK 

- SELDMS 



MUX 
EN 

V |32 
I35 



MUX 



V |36 

, I39 



MUX 
EN 

S7 |40 
I43 



MUX 
EN 

V |44 
I47 



MUX 
EN 

V I48 
151 



MUX 
EN 

y i52 

1 55 



MUX 
EN 

V |56 
I59 



MUX 
EN 

V |60 
I63 



■ OED8 
- D35 D32 



OED9 
D39 D36 



- OED10 

- D43 D40 



-OED12 
- D51 D48 



-OED13 
- D55 D52 



-0ED14 
- D59 D56 



-OED15 
- D63 D60 



FIGURE 1 
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architecture 

The 'ACT8841 digital crossbar switch has its 64 data l/Os arranged in 1 6 multiplexer logic blocks, as shown 
in Figure 2. Each nibble multiplexer logic block handles four bits of real-time input and four bits of stored- 
data input, and either input can be passed to the common data bus. 

Two input multiplexer controls are provided to select between stored and real-time inputs. SELDLS controls 
input data selection for the LSH (D31-D0) of the 64-bit data input, and SELDMS for the MSH (D63-D32). 
The input register clocks, LSCLK and MSCLK, are grouped in the same way and are used to clock data 
into the registers in the multiplexer logic blocks. The 16 data input nibbles make up the 64 data bits on 
the internal main bus. 

This common bus supplies 1 6 data nibbles to a 1 6-to-1 output multiplexer in each multiplexer logic block 
(see Figure 3). As determined by one of ten selectable control sources, the 16-to-1 output multiplexer 
selects a data nibble to send to the outputs via the three-state output driver. 

Control of the input and output multiplexers determines the input-to-output pattern for the entire crossbar 
switch. Many different switching combinations can be set up by programming the control flip-flop 
configurations to determine the outputs from the 16-to-1 multiplexers. 

For example, the switch can be programmed to broadcast one data input nibble through the other 1 5 nibbles 
(60 outputs). Conversely, a 15-to-1 nibble multiplexer can be configured by programming the switch to 
select and output a single data nibble from the 64-bit bus. Several examples are described in more detail 
in a later section. 
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functional block diagram 



D15 012 - 
5153 - 

019 D16 - 
5154- 



D51 048 
0ED12 




CNTR11 CNTR7 

CNTR8 CNTR4 
CNTR15- CNTR3- 
CNTR12 CNTRO 



FIGURE 2 
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multiplexer logic group 

There are 16 multiplexer logic blocks, one for each nibble. External data flows from four data I/O pins 
into a logic block. A block diagram of the multiplexer logic is shown in Figure 3. The data inputs are either 
clocked into the data register or passed directly to the main internal bus. The 64 bits of data from the 
main bus are presented to a 16-to-1 multiplexer, which selects the data nibble output. 

Each of the 16 nibble multiplexer logic blocks contains eight control flip-flop (CF) groups, one for each 
of the control banks. A control bank stores one complete switching configuration. Each CF group consists 
of four D-type edge-triggered flip-flops. In Figure 3, the CF groups are shown as CFXXO to CFXX7, where 
XX indicates the number of the nibble multiplexer logic group (0 < = XX < = 1 5). CFXXO represents the 
1 6 CF groups (one from each logic block) which make up flip-flop control bank 0, CFXX1 the 1 6 CF groups 
in bank 1 , etc. 

In addition to the eight banks of programmable flip-flops, two hard-wired switching configurations can 
be selected. The MSH/LSH exchange directs the input nibbles from each half of the switch to the data 
outputs directly opposite. This switching pattern is shown in Table 3 below. For example, data input on 
D1 1-D8 is output on D43-D40, and data input on D43-D40 is output on D1 1-D8. 

Table 3. MSH/LSH Exchange 



LSH 




MSH 


D3-D0 




D35-D32 


D7-D4 




D39-D36 


D11-D8 




D43-D40 


D15-D12 




D47-D44 


D19-D16 




D51-D48 


D23-D20 




D55-D52 


D27-D24 




D59-D56 


D31-D28 




D63-D60 



The second hard-wired configuration, a read-back function, causes all 64 bit to be output on the same 
l/Os on which they were input. Neither of the hard-wired control configurations affects the contents of 
the control banks. 

The control source select, CRSEL3-CRSEL0, determines which switching pattern is selected, as shown 
in Table 4. 



Table 4. 16-to-1 Output Multiplexer Control Source Selects 



CO 

z 

> 

o 

H 

00 
00 
4* 



CRSEL3 


CRSEL2 


CRSEL1 


CRSEL0 


CONTROL SOURCE SELECTED 


L 


L 


L 


L 


Control bank 0 


(programmable) 


L 


L 


L 


H 


Control bank 1 


(programmable) 


L 


L - 


H 


L 


Control bank 2 


(programmable) 


L 


L 


H 


H 


Control bank 3 


(programmable) 


L 


H 


L 


L 


Control bank 4 


(programmable) 


L 


H 


L 


H 


Control bank 5 


(programmable) 


L 


H 


H 


L 


Control bank 6 


(programmable) 


L 


H 


H 


H 


Control bank 7 


(programmable) 


H 


X 


X 


L 


MSH/LSH exchange* 


H 


X 


X 


H 


Read-back (output echoes input)* 



•Hard-wired switching configuration 
X = don't care 



Texas ^ 
Instruments 

POST OFFICE BOX 655012 • DALLAS. TEXAS 75265 



6-16 



SN74ACT8841 
DIGITAL CROSSBAR SWITCH 



control words 

A CF group can store a four-bit control word (CFN3-CFN0) to select the output of the 1 6-to-1 multiplexer 
for that nibble port. One control word is loaded in each CF group. A total of 16 words, one per multiplexer 
logic block, are loaded in a bank to configure one complete switching pattern. Table 5 lists the control 
words and the input data each selects. 

Each control word can be stored in a CF group and sent as an internal control signal to select the output 
of a 1 6-to-1 multiplexer in a nibble logic block. For example, any CF group loaded with the word "LHHH" 
will select the data input on D31-D28 as the outputs of the associated nibble. If all 16 CF groups in a 
bank were loaded with "LHHH," the same output (D31-D28) would be selected by the entire switch. 



Table 5. 16-to-1 Output Multiplexer Control Words 





INTERNAL SIGNALS 




INPUT DATA SELECTED AS 


CFN3 


CFN2 


CFN1 


CFNO 


MULTIPLEXER OUTPUT 


L 


L 


L 


L 


D3-D0 


L 


L 


L 


H 


D7 D4 


L 


L 


H 


L 


D11-D8 


I 


L 


H 


H 


D15-D12 


L 


H 


L 


L 


D19-D16 


L 


H 


L 


H 


D23-D20 


L 


H 


H 


L 


D27-D24 


L 


H 


H 


H 


D31-D28 


H 


L 


L 


L 


D35-D32 


H 


L 


L 


H 


D39-D36 


H 


L 


H 


L 


D43-D40 


H 


L 


H 


H 


D47-D44 


H 


H 


L 


L 


D51-D48 


H 


H 


L 


H 


D55-D52 


H 


H 


H 


L 


D59-D56 


H 


H 


H 


H 


D63-D60 



loading control configurations 

CRWRITE2-CRWRITE0 select which control bank is being loaded, as shown in Table 6. 



Table 6. Control Flip-Flops Load Destination Select 



CRWRITE2 


CRWRITE1 


CRWRITEO 


DESTINATION 


L 


L 


L 


Control bank 0 


L 


L 


H 


Control bank 1 


L 


H 


L 


Control bank 2 


L 


H 


H 


Control bank 3 


H 


L 


L 


Control bank 4 


H 


I 


H 


Control bank 5 


H 


H 


L 


Control bank 6 


H 


H 


H 


Control bank 7 
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The control words for a bank can be loaded either 1 6 bits at a time on the control I/O pins (CNTR1 5-CNTRO) 
or all 64 bits at once on the data inputs (D63-D0). If the control load source select, CRSRCE, is high, the 
words are loaded from the data inputs. When CRSRCE = L, the CNTR inputs are used. 

When a control bank is loaded from the data inputs, WE, CRSRCE, CRWRITE2-CRWRITEO, and the control 
register clock CRCLK are used in combination to load all 16 control words (64 bits) in a single cycle. A 
MSH/LSH exchange like that shown in Table 3 is used to load the flip flops on a rising CRCLK clock edge. 
For example, data inputs D3-D0 go to the data bus and then to the CF group that selects the data outputs 
for D35-D32. CRWRITE2-CRWRITEO select the control bank that is loaded (see Table 6). 

The CNTR1 5-CNTRO inputs can also be used to load the control banks. The bank is selected by 
CRWRITE2-CRWRITE0 (see Table 6). Four control words per CRCLK cycle can be input to the CF groups 
(CFXX) that make up the bank. The CF groups loaded are selected by CRADR1-CRADR0, as shown in 
Table 7. Four CRCLK cycles are needed to load an entire control bank. 

Table 7. Loading Control Flip-Flops from CNTR l/Os 



CRAD1 


CRADO 


WE 


CRCLK 


CF GROUPS LOADED BY 
CONTROL (CNTR) I/O NUMBERS 










15-12 


11-8 


7-4 


3-0 


L 


L 


L 


S 


CF12 


CF8 


CF4 


CFO 


L 


H 


L 


r 


CF13 


CF9 


CF5 


CF1 


H 


L 


L 


S 


CF14 


CF10 


CF6 


CF2 


H 


H 


L 


J" 


CF15 


CF1 1 


CF7 


CF3 


X 


X 


H 


X 


Inhibit write to flip-flops 



To read out the c ontrol settings, the same address signals can be used, except that no CRCLK signal is 
needed and OEC is pulled low. CREAD2-CREAD0 select the bank to be read; the format is the same as 
for CRWRITE2-CRWRITE0, shown in Table 6. 

Using the control l/Os to read the control bank settings can be valuable during debugging or diagnostics. 
Control settings are volatile and will be lost if the 'ACT8841 is powered off. An external program controlling 
switch operation may need to read the control bank settings so that it can save and restore the current 
switching configurations. 



test pins 
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TP1-TP0 test pins are provided for system testing. As Table 8 shows, these pins should be maintained 
high during normal oper ati on. T o for ce all outputs and l/Os low, low signals are placed on TP1-TP0 and 
all output enables (OED1 5-OEDO and OEC). To force all outputs and l/Os high, TP1 and all output enables 
are pulled low, and TPO is driven high. When TPO is left low and a high signal is placed on TP1 , all outputs 
on the 'ACT8841 are placed in a high-impedance state, isolating the chip from the rest of the system. 

Table 8. Test Pin Inputs 





OED15- 






TP1 TPO 


OED0 


OEC 


RESULT 


L L 


L 


L 


All outputs and l/Os forced low 


L H 


L 


L 


All outputs and l/Os forced high 


H L 


X 


X 


All outputs placed in a high-impedance state 


H H 


X 


X 


Normal operation (default state) 
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examples 

Most 'ACT8841 switch configurations are straightforward to program, involving few control signals and 
procedures to set up the control words in the banks of flip-flops. Control signals and procedures for loading 
and using control words are shown in the following examples. 

broadcasting a nibble 

Any of the 16 data input nibbles can be broadcast to the other 15 data nibbles for output. For ease of 
presentation, input nibble D63-D60 is used in this example. Example 1 presents the microcode sequence 
for loading flip-flop bank 0 and executing the nibble broadcast. 

The low signal on CRSRCE selects CNTR15-CNTR0 as the input source, and the low signals on 
CRWRITE2-CRWRITE0 select flip-flop bank 0 as the destination. Table 5 shows that to select data on 
D63-D60 as the output nibble, the four bits in the control word CFN3-CFN0 must be high; therefore the 
CNTR1 5-CNTRO inputs are coded high. The four microcode instructions shown in Example 1 load the same 
control word from CNTR1 5-CNTRO into all 16 CF groups of bank 0. 

Once the control flip-flops have been loaded, the switch can be used to broadcast nibble D63-D60 as 
programme d. T he microcode instruction to execute the broadcast is shown as the last instruction in 
Example 1 . WE is held high and the data to be broadcast is input on D63-D60. The high signal on SELDMS 
selects a real-time data input for the broadcast. MSCLK and LSCLK (not shown) can be used to load the 
input registers if the input nibble is to be retained. No register clock signals are needed if the input data 
is not being stored. 

The banks of control flip-flops not selected as a control source can be loaded with new control words 
or read out on CNTR1 5-CNTRO while the switch is operating. For example, the MSH data inputs can be 
used to load flip-flop bank 1 of the LSH while bank 0 of the LSH is controlling data I/O. 
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Example 1 . Programming a Nibble Broadcast 



INST. 
NO. 


CRSRCE 


CRWRITE2 


CR WRITE 1 


CRWRITEO 


CRADR1 


CRADRO 


CNTR I/O 
15-12 11-8 


NUMBE 
7-4 


AS 

3-0 


CRSEL3 


CRSEL2 


CRSEL1 


CRSELO 


WE 


SELDMS 


SELOLS 


OED15-OED0 


OEC 


CRCLK 




0 


0 


0 


0 


0 


0 


1111 1111 


1111 


1111 


X 


X 


X 


X 


0 


X 


X 


XXXX 


XXXX XXXX 


XXXX 






2 


0 


0 


0 


0 


0 




1111 1111 


1111 


1111 


X 


X 


X 


X 


0 


X 


X 


XXXX 


XXXX XXXX 


XXXX 




J 


3 


0 


0 


0 


0 


1 


0 


1111 1111 


1111 


1111 


X 


X 


X 


X 


0 


X 


X 


XXXX 


XXXX XXXX 


XXXX 




J 


4 


0 


0 


0 


0 




1 


1111 1111 


1111 


1111 


X 


X 


X 


X 


0 


X 


X 


XXXX 


XXXX XXXX 


XXXX 




r 


5 


X 


X 


X 


X 


X 


X 


XXXX XXXX 


XXXX 


XXXX 


0 


0 


0 


0 


1 




X 


1000 


0000 0000 


0000 




None 



INST. NO. 


COMMENT 


1 


Loads CF12. CF8, CF4, CFO of bank 0 


2 


Loads CF13. CF9, CF5, CF1 of bank 0 


3 


Loads CF14. CF10. CF6, CF2 of bank 0 


4 


Loads CF15, CF1 1. CF7, CF3 of bank 0 


5 


Selects bank 0 for switching control 




Selects real-time data inputs 



Example 2. Programming an MSH/LSH Exchange on CNTR Inputs 



INST. 
NO. 


CRSRCE 


CRWRITE2 CRWRITE1 CRWRITEO 


CRADR1 


CRADRO 


CNTR I/O 
15 12 118 


NUMBEI 
7 4 


re 

3-0 


CRSEL3 


CRSEL2 


CRSEL1 


CRSELO 


WE 


SELOMS 


SELOLS 


0ED15 0ED0 


OEC 


CRCLK 


1 


0 


1 1 1 


0 


0 


0100 0000 


1100 


1000 


X 


X 


X 




0 


X 


X 


XXXX 


XXXX XXXX 


XXXX 






2 


0 


1 1 1 


0 


1 


0101 0001 


1101 


1001 




X 


X 


X 


0 


X 


X 


XXXX 


XXXX XXXX 


XXXX 






3 


0 


1 11 




0 


0111 0011 


1111 


1011 




X 


X 


X 


0 


X 


X 


XXXX 


XXXX XXXX 


XXXX 








0 








0111 0011 


1111 


1011 


X 


X 




X 


0 


X 


X 


XXXX 


XXXX XXXX 


XXXX 






5 


X 


XXX 


X 


X 


XXXX XXXX 


XXXX 


XXXX 


0 


1 


1 






0 


0 


0000 


0000 0000 


0000 




None 



Comments 



INST NO 


COMMENT 


1 


Loads CF12. 


CF8. CF4. CFO of bank / 


2 


Loads CF13. 


CF9. CF5. CF1 of bank 7 


3 


Loads CF14. 


CF10. CF6. CF2 of bank 7 




Loads CF 15. 


CF11. CF7. CF3 of bank 7 


5 


Selects bank 


7 for switching control 




Selects regis 


tered data inputs 
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programming an MSH/LSH exchange ^ 

A second, more complicated example involves programming the switch to swap corresponding nibbles \j£ 

between the MSH and the LSH (first nibble in the LSH for first nibble in the MSH, and so on). This swap ^ 

can be implemented using the hard- wired logic circuit selected when CRSEL3 is high and CRSELO is low. yj 

Programming this swap without using the MSH/LSH exchange logic requires loading a different control £C 

word into each mux logic block. This is described below for purposes of illustration. CL 

Each nibble in one half, either LSH or MSH, selects as output the registered data from the corresponding h" 

nibble in the other half. The registered data from D35-D32 is to be output on D3-D0, the registered data O 

from D3-D0 is output on D35-D32, and so on for the remaining nibbles. As shown in Table 4, the flip-flops |J3 

for D3-D0 have to be set to 1 000 and the D35-D32 inputs must be low. The CF groups and control words Q 

involved in this switching pattern are listed in Table 9. Q 

cc 

Table 9. Control Words for an MSH/LSH Exchange Q. 



CF 
GROUP 


CNTR INPUTS 


CONTROL 










TO LOAD 
FLIP-FLOPS 


WORD 
LOADED 


RESULTS 




CF15 




01 1 1 


D31-D28 




D63 


D60 


CF14 


CNTR15- 


0110 


D27-D24 




D59 


D56 


CF13 


CNTR12 


0101 


D23-D20 




D55 


D52 


CF12 




0100 


D19-D16 




D51 


D48 


CF1 1 




001 1 


D15-D12 




D47 


D44 


CF10 


CNTR1 1 - 


0010 


D11-D8 




D43 


D40 


CF9 


CNTR8 


0001 


D7-D4 




D39 


D36 


CF8 




0000 


D3-D0 




D35 


D32 


CF7 




1111 


D63-D60 




D31 


D28 


CF6 


GNTR7- 


mo. 


D59-D56 




D27 


D24 


CF5 


CNTR4 


1101 


055-D52 




D23 


D20 


CF4 




1 100 


D51-D48 




D19 


D16 


CF3 




1011 


D47-D44 




D15 


D12 


CF2 


CNTR3- 


1010 


D43-D40 




D11 


D8 


CF1 


CNTRO 


1001 


D39-D36 




D7 


D4 


CFO 




1000 


D35-D32 




D3 


DO 



With this list of control words and the signals in Table 7, the 16-bit control inputs on CNTR15-CNTR0 
can be arranged to load the control flip-flops in four cycles. Example 2 shows the microcode instructions 
for loading the control words and executing the exchange. 

In Example 2, bank 7 of flip-flops is being programmed. Bank 7 is selected by taking CRWRITE2-CRWRITE0 
high and leaving CRSRCE low (see Table 4) when the control words are loaded on CNTR1 5-CNTRO. With qq 
WE held low, the CRCLK is used to load the four sets of control words. Once the flip-flops are loaded, QO 
data can be input on D63-D0 and the programmed pattern of output selection can be executed. A 
microinstruction to select registered data inputs and bank 7 as the control source is shown as the last O 
instruction in Example 2. The data must be clocked into the input registers, using LSCLK and MSCLK, ^ 
before the last instruction is executed. 
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The control flip-flops could also have been loaded from the data input nibbles in one CRCLK cycle. Input 
nibbles from one half are mapped onto the control flip-flops of the other half. All control words to set up 
a switching pattern should be loaded before the bank of flip-flops is selected as control source. The 
microcode instructions to load bank 1 with the 16 control words in one cycle are presented in Example 3. 

Example 3. Loading the MSH/LSH Exchange from Data Inputs 



c/> 
z 

> 

O 
H 
00 
00 
4* 



CRSRCE 


CRWRITE2 


CRWRITE1 


CRWRITE0 


we 


SELDMS SELDLS 


9ID1 5(51150 


CRCLK 


1 


0 


0 


1 


0 


1 1 


1111 1.1*1 1111 1111 





These control nibbles may be loaded from the input as a 64-bit real-time input word or as two 32-bit words 
stored previously. To use stored control words, MSCIK and LSCLK are used to load the LSH and MSH 
input registers with the correct sequence of control nibbles. Whenever the flip-flops are loaded from the 
data inputs, all 64 bits of control data must be present when the CRCLK is used so that all control nibbles 
in a program are loaded simultaneously. Example 4 presents the three microcode instructions to load the 
MSH and LSH input registers and then to pass the registered data to flip-flop bank 2. 

Example 4. Loading Control Flip-Flops from Input Registers 



INST. 
NO. 


CRSRCE 


CRWRITE2 


CRWRITE1 


CRWRITE0 


WE 


SELDMS 


SELDLS 


Ol5l5- 
SlfJo 


CRCLK 


MSCLK 


LSCLK 


COMMENTS 


1 


X 


X 


X 


X 


1 


X 


X 


1 


None 


s 


None 


Load inputs 
D63-D32 


2 


X 


X 


X 


X 


1 


X 


X 


1 


None 


None 


_T 


Load inputs 
D31-D0 


3 


1 


0 


1 


0 


0 


0 


0 


1 


_r 


None 


None 


Load control 
bank 2 



The control words in a program can also be read back from the flip-flops using the CNTR outputs. Four 
instructio ns ar e necessary to read the 64 bits in a bank of flip-flops out on CNTR1 5-CNTR0. WE is held 
high and OEC is taken low. No CRCLK signal is required. CREAD2-CREAD0 select bank 2 of flip-flops, 
and CRADR1-CRADR0 select in sequence the four addresses of the 16-bit words to be read out on the 
CNTR outputs. Example 5 shows the four microcode instructions. 

Example 5. Reading Control Settings on CNTR Outputs 



INST. 
NO. 


CREAD2 CREAD1 CREAD0 


OEC 


CRADR1 CRADR0 


WE 


CNTR I/O NUMBERS 
15-12 11-8 7-4 3-0 


COMMENT 


1 
2 
3 
4 


0 1 0 
0 10 
0 1 0 
0 1 0 


0 
0 
0 
0 


0 0 

0 1 

1 0 

1 1 


1 
1 
1 
1 


0100 0000 1100 1000 

0101 0001 1101 1001 

0110 0010 1110 1010 

0111 0011 1111 1011 


Read CF12, CF8, CF4, CFO 
Read CF13, CF9, CF5. CF1 
Read CF14, CF10, CF6, CF2 
Read CF15, CF11, CF7, CF3 
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SN74ACT8841 
DIGITAL CROSSBAR SWITCH 



absolute maximum ratings over operating free-air temperature range (unless otherwise noted) 1 ' 



Supply voltage, Vqc • -0.5 V to 6 V 

Input clamp current, l|K (V| < 0 or V| > V<x) •••••• ±20 mA 

Output clamp current, Iok < v 0 < 0 or Vo > Vcc> • ±50 mA 

Continuous output current, lo (Vo = 0 to VcC) ±50 m A 

Continuous current through Vcc or GND pins ±100 mA 

Operating free-air temperature range 0°C to 70 °C 

Storage temperature range . . -65°C to 150°C 



Stresses beyond those listed under "absolute maximum ratings" may cause permanent damage to the device. These are stress ratings 
only and functional operation of the device at these or any other conditions beyond those indicated under "recommended operating 
conditions" is not implied. Exposure to absolute-maximum-rated conditions for extended periods may affect device reliability. 



recommended operating conditions 



PARAMETER 


MIN NOM 


MAX 


UNIT 


vcc 


Supply voltage 


4.5 5.0 


5.5 


V 


V|H 


High-level input voltage 


2 


v C c 


V 


V|L 


Low-level input voltage 


0 


0.8 


V 


•oh 


High-level output current 


-8 


mA 


lOL 


Low-level output current 


8 


mA 


V| 


Input voltage 


o 


vcc 


V 


v 0 


Output voltage 


0 


vcc 


V 


dt/dv 


Input transition rise or fall rate 


0 


15 


ns/V 


T A 


Operating free-air temperature 


0 


70 


°C 



electrical characteristics over recommended operating free-air temperature range (unless otherwise 
noted) 



PARAMETER 


TEST CONDITIONS 


vcc 


T A - 25°C 


MIN TYP MAX 


UNIT 


MIN TYP MAX 


VOH 


l 0 H = - 20 jtA 


4.5 V 




4.4 


V 


5.5 V 




5.4 


lOH - 8 mA 


4.5 V 


3.8 


3.7 


5.5 V 


4.8 


4.7 


vol 


l 0 L = 20 (iA 


4.5 V 




0.1 


V 


5.5 V 






'0L = 8 mA 


4.5 V 


0.32 


0.4 


5.5 V 


0.32 


0.4 


ioz 


v 0 = V CC or 0 


5 V 


±0.5 


±0.5 


MA 


it 


V| = Vcc or 0 


5.5 V 


0.1 


± 1 


mA 


•cc 


V| = V C C or 0, lo 


5.5 V 




100 


^A- 


C| 


V| = Vcc or 0 


5 V 






pF 



^This is the increase in supply current for each input that is at one of the specified TTL voltage levels rather than 0 V or Vcc- 
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SN74ACT8841 

DIGITAL CROSSBAR SWITCH 



switching characteristics over recommended ranges of supply voltage and operating free-air temperature 
(unless otherwise noted) 



PARAMETER 


FROM 


TO 


MIN TYPt 


MAX 


UNIT 




Data in 




7 


14 






MSCLK, LSCLK 




10 


18 






SELDMS, SELDLS 


Data out 


9 


15 






CRCLK 




12 


19 




tpd 


CRSEL3-CRSEL0 




12 


19 


ns 




CREAD2-CREAD0 




10 18 






CRCLK 


CNTRn 


10 


18 






CRAD1 , CRADO 




8 


16 






TP1 , TPO 


All outputs 


10 


19 






TP1, TPO 


All outputs 


10 


15 




t en 


OED 


Data out 


7 


12 


ns 




OEC 


CNTRn 


8 


14 






TP1.TP0 


All outputs 


10 


15 




tdis 


OED 


Data out 


5 


8 


ns 




OEC 


CNTRn 


6 10 





t AII typical values are at VCC = 5 V, TA = 25 °C. 



timing requirements over recommended ranges of supply voltage and operating free-air temperature 
(unless otherwise noted) 



PARAMETER 


MIN MAX 


UNIT 


t w Pulse duration 


LSCLK, MSCLK, CRCLK high or low 


7 


ns 


t su Setup time before CRCLK 


Data 


7 


ns 


CNTRn 


7 


SELDMS, SELDLS 


9 


CRADR1 .CRADRO 


8 


CRSRCE, CRWRITE2-CRWRITE0 


8 


LSCLK, MSCLK 


10 


WE 


8 


t su Setup time, data before LSCLK or MSCLK 


7 


ns 


t n Hold time after CRCLK 


Data 


0 


ns 


CNTRn 


0 


SELDMS, SELDLS 


0 


CRADR1, CRADRO 


0 


CRSRCE, CRWRITE 


0 


We 


0 


t n Hold time, data after LSCLK or MSCLK 


0 


ns 



Texas ^ 
Instruments 

POST OFFICE BOX 655012 • DALLAS. TEXAS 75265 



6-24 



SN74ACT8841 
DIGITAL CROSSBAR SWITCH 



AS8840 AND ACT8841 FUNCTIONAL COMPARISON 

differences between the SN74AS8840 and the SN74ACT8841 

The SN74AS8840 and the SN74ACT8841 digital crossbar switches essentially perform the same function. 
The SN74AS8840 and the SN74ACT8841 are based on the same 1 6-port architecture, differing in the 
number of control registers, power consumption, and pin-out. 

One difference is in the number of programmable control flip-flop banks available to configure the switch. 
The 'AS8840 has two programmable control banks, while the 'ACT8841 has eight. Both have two 
selectable hard-wired switching configurations. 

The increased number of control banks in the 'ACT8841 require six additional pins not found on the 
'AS8840. These are: CRWRITE2, CRWRITE1, CREAD2, CREAD1, CRSEL3, and CRSEL2. CREAD and 
CRWRITE on the '8840 become CREADO and CRWRITEO on the '8841 . On the '8840, CRSEL1 selects 
the hardwired control functions when high. This function is performed by the CRSEL3 signal on the '8841 . 
Therefore, CRSEL2 and CRSEL1 are actually the added signals. 

The 'ACT8841 is a low-power CMOS device requiring only 5-V power. Because of its STL internal logic 
and TTL l/Os, the 'AS8840 requires both 2-V and 5-V power. 

Both the 'AS8840 and the 'ACT8841 are in 156 pin grid-array packages, however, the two devices are 
not pin-for-pin compatible. Control signals were added to the 'ACT8841 and the 2-V Vcc P |ns CAS8840 
only) were assigned other functions in the 'ACT8841. 

changing AS8840 microcode to 'ACT8841 microcode 

Since only six signals have been added to the 'ACT8841, changing existing 'AS8840 microcode to 
'ACT8841 microcode is straight forward. CRSEL3 on the 'ACT8841 is functionally equivalent to CRSEL1 
on the 'AS8840. CREAD2, CREAD1 , CRWRITE2, CRWRITE1 , CRSEL2, and CRSEL1 bits must be added. 
These can always be 0 if no additional control banks are needed. Additional control configurations can 
be stored by programming these bits. 

All other signals in the 'AS8840 microcode remain the same when converting to 'ACT8841 microcode. 
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Overview 



1 



SN74ACT8818 


16-Bit Microsequencer 


2 




SN74ACT8832 


32-Bit Registered ALU 


3 



SN74ACT8836 32- x 32-Bit Parallel Multiplier 




SN74ACT8837 


64-Bit 


Floating Point Processor 


5 




SN74ACT8841 


Digital 


Crossbar Switch 


6 



SN74ACT8847 64-Bit Floating Point/Integer Processor 






8 




Mechanical Data 


9 
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SN74ACT8847 
64-Bit Floating Point Unit 



• Superset of Tl's SN74ACT8837 

• 30-ns, 40-ns and 60-ns Pipelined Performance 

• Low-Power EPIC™ CMOS 

• Meets IEEE Standard for Single- and Double- 
Precision Formats 

• Performs Floating Point and Integer Add, 
Subtract, Multiply, Divide, Square Root, and 
Compare 

• 64-Bit IEEE Divide in 1 1 Cycles, 64-Bit Square 
Root in 14 Cycles 

• Performs Logical Operations and Logical Shifts 



The SN74ACT8847 is a high-speed, double-precision floating point and integer 
processor. It performs high-accuracy, scientific computations as part of a 
customized host processor or as a powerful stand-alone device. Its advanced 
math processing capabilities allow the chip to accelerate the performance of both 
CISC- and RISC- based systems. 

High-end computer systems, such as graphics workstations, mini-computers and 
32-bit personal computers, can utilize the single-chip 'ACT8847 for both floating 



point and integer functions. 




EPIC is a trademark of Texas Instruments Incorporated. 
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Introduction 



The SN74ACT8847 combines a multiplier and an arithmetic-logic unit in a single 
microprogrammable VLSI device. The 'ACT8847 is implemented in Texas instruments 
one-micron CMOS technology to offer high speed and low power consumption with 
exceptional flexibility and functional integration. The FPUs can be microprogrammed 
to operate in multiple modes to support a variety of floating point applications. 

The 'ACT8847 is fully compatible with the IEEE standard for binary floating point 
arithmetic, STD 754-1985. This FPU performs both single- and double-precision 
operations, integer operations, logical operations, and division and square root 
operations (as single microinstructions). 

Understanding the 'ACT8847 Floating Point Unit 

To support floating point processing in IEEE format, the 'ACT8847 may be configured 
for either single- or double-precision operation. Instruction inputs can be used to select 
three modes of operation, including independent ALU operations, independent multiplier 
operations, or simultaneous ALU and multiplier operations. 

Three levels of internal data registers are available. The device can be used in 
flowthrough mode (all registers disabled), pipelined mode (all registers enabled), or 
in other available register configurations. An instruction register, a 64-bit constant 
register, and a status register are also provided. 

Each FPU can handle three types of data input formats. The ALU accepts data operands 
in integer format or IEEE floating point format. A third type of operand, denormalized 
numbers, can also be processed after the ALU has converted them to "wrapped" 
numbers, which are explained in detail in a later section. The 'ACT8847 multiplier 
operates on normalized floating-point numbers, wrapped numbers, and integer 
operands. 

Microprogramming the 'ACT8847 

The 'ACT8847 is a fully microprogrammable device. Each FPU operation is specified 

by a microinstruction or sequence of microinstructions which set up the control inputs ^ 

of the FPU so that the desired operation is performed. 59 

00 

The microprogram which controls operation of the FPU is stored in the microprogram q 

memory (or control store). Execution of the microprogram is controlled by a < 

microsequencer such as the? Tl SN74ACT881 8 1 6-bit microsequencer. A discussion JjJ 

of microprogrammed architecture and the operation of the 'ACT8818 is presented 2 

in this Data Manual. CO 
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Support Tools 

Texas Instruments has developed functional evaluation models of the 'ACT8847 in 
software which permit designers to simulate operation of the FPU. To evaluate the 
functions of an FPU, a designer can create a microprogram with sample data inputs, 
and the simulator will emulate FPU operation to produce sample data output files, as 
well as several diagnostic displays to show specific aspects of device operation. Sample 
microprogram sequences are included in this section. 

Texas Instruments has also designed a family of low-cost real-time evaluation modules 
(EVM) to aid with initial hardware and microcode design. Each EVM is a small self- 
contained system which provides a convenient means to test and debug simple 
microcode, allowing software and hardware evaluation of components and their 
operation. 

At present, the 74AS^EVM-8 Bit-Slice Evaluation Module has been completed, and 
a 1 6-bit EVM is in an advanced stage of development. EVMs and support tools for 
devices in the VLSI family are planned for future development. 

Design Support 

Texas Instruments Regional Technology Centers, staffed with systems-oriented 
engineers, offer a training course to assist users of Tl LSI products and their application 
to digital processor systems. Specific attention is given to the understanding and 
generation of design techniques which implement efficient algorithms designed to 
match high-performance hardware capabilities with desired performance levels. 

Information on VLSI devices and product support can be obtained from the following 
Regional Technology Centers: 
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Atlanta 

Texas Instruments Incorporated 
3300 N.E. Expressway, Building 8 
Atlanta, GA 30341 
404/662-7945 

Boston 

Texas Instruments Incorporated 
950 Winter Street, Suite 2800 
Waltham, MA 02154 
617/895-9100 

Northern California 

Texas Instruments Incorporated 

5353 Betsy Ross Drive 

Santa Clara, CA 95054 

408/748-2220 



Chicago 

Texas Instruments Incorporated 

51 5 Algonquin 

Arlington Heights, IL 60005 

312/640-2909 

Dallas 

Texas Instruments Incorporated 
10001 E. Campbell Road 
Richardson, TX 75081 
214/680-5066 

Southern California 

Texas Instruments Incorporated 

1 7891 Cartwright Drive 

Irvine, CA 92714 

714/660-8140 
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ACT8847 Logic Symbol 



ACT8847 
64-Bit Floating Point Unit 
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• C REGISTER CLOCK 
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INSTRUCTION. RA, & RB 
REGISTERS 



ALU AND MULTIPLIER 
PIPELINE REGISTERS 



STATUS. P. AND S 
REGISTERS 



PARITY 

I/O 



STATUS 
PARITY 



FLOWTHROUGH 
EN 

FLOWTHROUGH 
EN 

I FLOWTHROUGH 
I EN 

DA DATA 
DB DATA 
Y BUS 

DA DATA 
DB DATA 
MASTER/SLAVE 
COMPARATOR 



INSTRUCTION 



LOAD RA REGISTER 
LOAD RB REGISTER 
EXCEPTION & OTHER STATUS 
COMPARISON STATUS 
Y31-Y0, PY3-PY0 



COMPARISON 
STATUS 



EXCEPTION 
AND 
OTHER 
STATUS 



ZL 



r 4 - 



:> 



< data > 



0 




« ► 


• 




• 


• 


• 


• 


31 




4-> 
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Design Expertise 



Texas Instruments can provide in-depth technical design assistance through 
consultations with contract design services. Contact your local Field Sales Engineer 
for current information or contact VLSI Systems Engineering at 214/997-3970. 

'ACT8847 Pin Descriptions 

Pin descriptions and grid allocation for the 'ACT8847 are given on the following pages. 



A 
B 
C 
D 
E 
F 
G 
H 



M 
N 
P 
R 
S 
T 



208 PIN . . . GA PACKAGE 
(TOP VIEW) 



3 4 



6 



8 



10 11 12 13 14 15 16 17 
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Table 1 . ' ACT8847 Pin Grid Allocation 





PIN 




PIN 




PIN 




PIN 




PIN 




PIN 


NO. 


NAME 


NO. 


NAME 


NO. 


NAME 


NO. 


NAME 


NO. 


NAME 


NO. 


NAME 


A1 


NC 


C2 


YO 


E3 


FAST 


J1 5 


FLOWC 


P1 


ENRG 


S1 


NC 


A2 


INF 


C3 


Y3 


E4 


GND 


J16 


SRCC 


P2 


PI PESO 


S2 


PBO 


A3 


Y5 


C4 


Y6 


E14 


GND 


J17 


BYTEP 


P3 


RESET 


S3 


DBO 


A4 


Y8 


C5 


Y9 


E15 


AGTB 


K1 


SELOP3 


P4 


PB1 


S4 


DB4 


A5 


Y1 1 


C6 


Y12 


E16 


AEQB 


K2 


SEL0P4 


P5 


DB1 


S5 


DB1 1 


A6 


Y14 


C7 


Y15 


E1 7 


MSERR 


K3 


SELOP5 


P6 


DB5 


S6 


DB12 


A7 


Y17 


C8 


Y18 


F1 


I5 


K4 


GND 


P7 


DB9 


S7 


DB1 5 


A8 


Y20 


C9 


Y23 


F2 


13 


K14 


GND 


P8 


DB16 


S8 


DB19 


A9 


Y21 


C10 


Y26 


F3 


RNDO 


K15 


PA1 


P9 


DB21 


S9 


DB23 


A10 


Y24 


C1 1 


Y30 


F4 


GND 


K16 


PA 2 


P10 


DB28 


S10 


DB26 


A1 1 


Y27 


C12 


PY1 


F14 


GND 


K1 7 


PA 3 


P1 1 


DAO 


S1 1 


DB30 


A12 


Y29 


C13 


UNDER 


F15 


PERRA 


L1 


SELOP6 


P12 


DA4 


S12 


DA2 


A13 


PYO 


C14 


INEX 


F16 


OEY 


L2 


SELOP7 


P13 


DA8 


S13 


DA6 


A14 


PY3 


C15 


DENIN 


F17 


UES 


L3 


CLK 


P14 


DA12 


S14 


DA10 


A15 


IVAL 


C16 


SRCEX 


G1 


17 


L4 


Vrr 


P15 


DA19 


S15 


DA14 


A16 


NEG 


C17 


CHEX 


G2 


16 


L14 


GND 


P16 


DA22 


S16 


DA1 5 


A17 


NC 


D1 


11 


G3 


14 


L1 5 


DA30 


P17 


DA23 


S17 


DA1 7 


B1 


ED 


D2 


RND1 


G4 


Vrr 


L16 


DA31 


R1 


PIPES1 


T1 


NC 


B2 


Y2 


D3 


Y1 


G14 


Vrr 


L17 


PAO 


R2 


HALT 


T2 


PB3 


B3 


Y4 


D4 


GND 


G15 


OEC 


Ml 


ENRB 


R3 


PB2 


T3 


DB3 


B4 


Y7 


D5 


Vrr 


G16 


SELMS/LS 


M2 


ENRA 


R4 


DB2 


T4 


DB7 


B5 


Y10 


D6 


GND 


G17 


TEST1 


M3 


CLKC 


R5 


DB6 


T5 


DB8 


B6 


Y13 


D7 


GND 


H1 


110 


M4 


GND 


R6 


DB10 


T6 


DB13 


B7 


Y16 


D8 


Vrr 


H2 


19 


M14 


Vrr 
v LL 


R7 


DB14 


T7 


DB17 


B8 


Y19 


D9 


GND 


H3 


18 


M15 


DA27 


R8 


DB18 


T8 


DB20 


B9 


Y22 


D10 


GND 


H4 


GND 


M16 


DA28 


R9 


DB22 


T9 


DB24 


B10 


Y25 


D11 


v C c 


H14 


GND 


M17 


DA29 


R10 


DB27 


T10 


DB25 


B11 


Y28 


D12 


GND 


H15 


TESTO 


N1 


CONFIGO 


R11 


DB31 


T11 


DB29 


B12 


Y31 


D13 


GND 


H16 


SELST1 


N2 


CONFIG1 


R12 


DA3 


T12 


DA1 


B13 


PY2 


D14 


v C c 


H17 


SELSTO 


N3 


CLKMODE 


R13 


DA7 


T13 


DA5 


B14 


OVER 


D15 


STEX1 


J1 


SELOP2 


N4 


PIPES2 


R14 


DA11 


T14 


DA9 


B15 


RNDCO 


D16 


STEXO 


J2 


SELOP1 


N14 


DA18 


R15 


DA16 


TT5 


DA13 


B16 


DENORM 


D17 


UNORD 


J3 


SELOPO 


N15 


DA24 


R16 


DA20 


T16 


NC 


B17 


DIVBYO 


E1 


12 


J4 


v C c 


N16 


DA25 


R17 


DA21 


T17 


NC 


C1 


PERRB 


E2 


IO 


J14 


v C c 


N17 


DA26 











m 



oo 

00 

o 
< 

z 
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Table 2. 'ACT8847 Pin Functional Description 



PIN 
NAME 


NO. 


I/O 


DESCRIPTION 








Comparison status or zero detect pin. When high, 


AEQB 


E1 6 


I/O 

If w 


indicates that A and B operands are equal during a 
compare operation in the ALU. If not a compare, a 
high signal indicates a zero result on the Y bus. 


AGTB 


E1 5 


I/O 


Comparison status pin. When high, indicates that A 
operand is greater than B operand. 








When high, selects parity generation for each byte 


BYTEP 


J17 


I 


of input (four parity bits for each bus). When low, 




selects parity generation for whole 32-bit input 
(one parity bit for each bus). 








Status pin indicating an exception during a chained 


CHEX 


C1 7 


I/O 


function. If 16 is low, indicates the multiplier is the 
source of an exception. If 16 is high, indicates the 
ALU is the source of an exception. 


CLK 


L3 


I 


Master clock for all registers except C register 


CLKC 


M3 


I 


C register clock 








Selects whether temporary register loads only on 


CLKMODE 


N3 


I 


rising clock edge (CLKMODE = L) or on falling 
edge (CLKMODE = H). 


CONFIGO 


NT 


I 


Select data sources for RA and RB registers from 


CONFIG1 


N2 




DA bus, DB bus and temporary register 


DAO 


P1 1 






DA1 


T12 






DA2 


S12 






DA3 


R12 






DA4 


P12 






DA5 


T13 






DA6 


S13 






DA7 


R13 






DAS 


P13 






DA9 


T14 






DA 10 


S14 






DA1 1 


R14 




DA 32-bit input data bus. Data can be latched in a 


DA12 


P14 


I 


64-bit temporary register or loaded directly into an 


DAI 3 


T15 




input register 


DAI 4 


$15 






DA15 


S16 






DA16 


R15 






DA17 


S17 






DA18 


N14 






DA19 


P15 






DA20 


R16 






DA21 


R 1 7 






DA22 


P16 






DA23 


P17 






DA24 


N15 







Table 2. ' ACT8847 Pin Functional Description (Continued) 



PIN 
NAME 


NO. 


i/n 


tifqpription 


DA25 


N16 






DA26 


N17 






DA27 


M15 




DA 32-bit input data bus. Data can be latched in a 


DA28 


M16 


I 


64-bit temporary register or loaded directly into an 


DA29 


M17 




input register. 


DA30 


L15. 






DA31 


L16 






DBO 


S3 






DB1 


P5 






DB2 


R4 






DB3 


T3 






DB4 


S4 






DB5 


P6 






DB6 


R5 






DB7 


T4 






DB8 


T5 






DB9 


?7 






DB10 


R6 






DB1 1 


S5 






DB12 


S6 






OB 13 


T6 






DB14 
DB15 
DB16 
DB17 


R7 
S7 
P8 
T7 


I 


DB 32-bit input data bus. Data can be latched in a 




64-bit temporary register or loaded directly into an 
input register. 


DB18 


R8 






DB19 


S8 






DB20 


T8 






DB21 


P9 






DB22 


R9 






DB23 


S9 






DB24 


T9 






DB25 


T10 






DB26 


S10 






DB27 


R10 






DB28 


P10 






OB29 


T11 






DB30 


S11 






DB31 


R1 1 












Status pin indicating a denormal input to the 


DENIN 


C15 


I/O 


multiplier. When DENIN goes high, the STEX pins 
indicate which port had the denormal input. 








Status pin indicating a denormal output from the 


DENORM 


B16 


I/O 


ALU or a wrapped output from the multiplier. In 
FAST mode, causes the result to go to zero when 
DENORM is high. 



Table 2. 'ACT8847 Pin Functional Description (Continued) 



PIN 

NAME NO. 



I/O 



DESCRIPTION 



DIVBYO 



B17 



I/O 



Status pin indicating an attempted operation 
involved dividing by zero 



ED 



B1 



I/O 



Exception detect status signal representing logical 
OR of all enabled exceptions in the exception 
disable register 



ENRA 



M2 



When high, enables loading of RA register on a 
rising clock edge if the RA register is not disabled 
(see PIPESO below). 



ENRB 



M1 



When high, enables loading of RB register on a 
rising clock edge if the RB register is not disabled 
(see PIPESO below). 



ENRC 



P1 



When low, enables write to C register when CLKC 
goes high. 



FAST 



E3 



When low, selects gradual underflow (IEEE model). 
When high, selects sudden underflow, forcing all 
denormalized inputs and outputs to zero. 



FLOWC 



J15 



When high, causes product or sum to bypass 
C register, so that product or sum appears on the 
C register output bus. Timing is similar to P register 
or S register feedback operands. C register remains 
unchanged. Product or sum may also be 
simultaneously fed back in usual manner (not 
through C register). 



GND 


D4 


GND 


D6 


GND 


D7 


GND 


D9 


GND 


D10 


GND 


D12 


GND 


D13 


GND 


E4 


GND 


E14 


GND 


F4 


GND 


F14 


GND 


H4 


GND 


H14 


GND 


K4 


GND 


K14 


GND 


L14 


GND 


M4 


HALT 


R2 



Ground pins. NOTE: All ground pins should be 
used and connected. 



Stalls operation without altering contents of 
instruction or data registers. Active low. 
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Table 2. 'ACT8847 Pin Functional Description (Continued) 



PIN 

NAME NO. 


I/O 


DESCRIPTION 


10 E2 

11 D1 

12 E1 

13 F2 

14 G3 

15 F1 

16 G2 

17 G1 
io no 
19 H2 
110 HI 


I 


Instruction inputs 


INEX C14 


I/O 


Status pin indicating an inexact output 


INF A2 


I/O 


When high, indicates output value is infinity. 


IVAL A1 5 


0 


Status pin indicating that an invalid operation or a 
nonnumber (NaN) has been input to the multiplier 

_ _ A 1 II 

or ALU, 


K /l C CD D C1 T 

MbtHK b I / 


U 


Master/Slave error output pin 


NC A1 
NC Al 7 

NC T1 
NC T16 
NC T17 




No internal connection. Pins should be left floating 


NEG A15 


I/O 


When high, indicates result has negative sign. 


OEC G15 


I 


Comparison status output enable/ Active low. 


OES F17 


I 


Exception status and other status output enable. 
Active low. 


OEY F16 


I 


Y bus output enable. Active low. 


OVER B14 


I/O 


Status pin indicating that the result is greater the 
largest allowable value for specified format 
(exponent overflow). 


PAO • L17 
PA1 K1 5 
PA2 K16 
PA3 K 1 7 


I 


Parity inputs for DA data 


PBO S2 
PB1 P4 
PB2 R3 
PB3 12 


I 


Parity inputs for DB data 


PERRA F15 


0 


DA data parity error output. When high, signals a 
byte or word has failed an even parity check. 


PERRB C1 


0 


DB data parity error output. When high, signals a 
byte or word has failed an even parity check. 



Table 2. 'ACT8847 Pin Functional Description (Continued) 



PIN 

NAME NO. 


I/O 


DESCRIPTION 


PIPESO P2 


I 


When low, enables instruction register and, 
depending on setting of ENRA and ENRB, the RA 
and RB input registers. When high, puts instruction, 
RA and RB registers in flowthrough mode. 


pipcci pi 


i 


When low, enables pipeline registers in ALU and 
multiplier. When high, puts pipeline registers in 
flowthrough mode. 


PIPES2 N4 


1 


When low, enables status register, product (P) and 
sum (S) registers. When high, puts status register, 
r ana o rcgioicra in Tiowxnrouyn moac. 


PYO A13 
dv 1 n o 

r Y I U I Z 

PY2 B13 
PY3 A14 


I/O 


Y port parity data 


RESET P3 


1 


Clears internal states, status, and exception disable 
register. Contents of internal pipeline registers are 
lost. Does not affect other data registers. Active 
low. 


RNDO F3 
RND1 D2 


1 


Rounding mode control pins. Select four IEEE 
rounding modes. 


RNDCO B15 


I/O 


When high, indicates the mantissa of a wrapped 
number has been increased in magnitude by 
rounding. 


SELOPO J3 
SELOPi J2 
SELOP2 J1 
SEL0P3 K1 
SELOP4 K2 

CCI HDC V Q 

OtLUrO l\o 

SEL0P6 L1 
SEL0P7 L2 


1 


Select operand sources for multiplier and ALU 


QFI QTH U17 

SELST1 H16 


1 


Select status source during chained operation 


SELMS/LS G16 


1 


vvnen low, seiecxs i_on 01 oh--uix resuii 10 ut? 
output on the Y bus. When high, selects MSH of 
64-bit result. (No effect on single-precision 
operations.) 


SRCC J16 


1 


When low, selects ALU as data source for C 
register. When high, selects multiplier as data 
source for C register. 


SRCEX C16 


I/O 


Status pin indicating source of exception, either 
ALU (SRCEX = L) or multiplier (SRCEX = H). 


STEXO D16 
STEX1 D15 


I/O 


Status pins indicating that a nonnumber (NaN) or 
denormal number has been input on A 
port (STEX1) or B port (STEXO). 



Table 2. ACT8847 Pin Functional Description (Continued) 



PIN 
NAME 


NO. 


I/O 


DESCRIPTION 


TESTO 
TEST1 


H1 5 
G1 7 


I 


Test pins 








Qtati ic r-\ i r\ inHipotinn l**i o + o root 1 1 + ic idDvart zmri 
Oldlub pill If lUIUdlli iy llldl d icbull lb IMcAdL-l dllU 






I/O 


lace than mir*irvM inn QllrwA/altlo \ / o 1 1 10 f nr f nrm at 
lebo IMdil nilllllMUIII dllUWdUlc VdlUt; IUI lUlllldl 

(exponent underflow). 








Comparison status pin indicating that the two 


UNOHD 


D1 7 


l/U 


inputs are unordered because at least one of them 
is a nonnumber (NaN). 


v cc 


D5 






vcc 


D8 






v cc 


D1 1 






V CC 


D1 4 






V CC 
V CC 


b4 




5-V power supply 


V CC 


I A 

J4 






V CC 


J1 4 






v cc 


L4 






V CC 


M 1 4 






YO 


C2 






Y1 


D3 






Y2 


B2 






Y3 


C3 






Y4 


B3 






Y5 


A3 






Y6 


C4 






Y7 


B4 






Y8 


A4 






Y9 


C5 






Y10 


B5 






Y11 


A5 






Y12 


C6 






Y13 


B6 






Y14 


A6 






Y15 


C7 


I/O 


32-bit Y output data bus 


Y16 


B7 






Y17 


A7 






Y18 


C8 






Y19 


B8 






Y20 


A8 






Y21 


A9 






Y22 


B9 






Y23 


C9 






Y24 


A10 






Y25 


B10 






Y26 


C10 







Table 2. 'ACT8847 Pin Functional Description (Concluded) 



PIN 

NAME 


NO. 


I/O 


DESCRIPTION 


Y27 


A11 






Y28 


B11 






Y29 


A12 


I/O 


32-bit Y output data bus 


Y30 


C1.1 






Y31 


B12 







ACT8847 Specifications 



absolute maximum ratings over operating free-air temperature range 
(unless otherwise noted) 1 " 

Supply voltage, Vcc • • • ^0.5 V to 6 V 

Input clamp current, l|K (V| < 0 or V| > Vqc) ±20 mA 

Output clamp current, IfjK (VO < 0 or Vo > Vcc) • • ■ • ±50 mA 

Continuous output current, lo (Vo = Vqc ±50 mA 

Continuous current through Vcc or GND pins ±100 mA 

Operating free-air temperature range . . . 0°C to 70 °C 

Storage temperature range . . -65°C to 150°C 

t Stresses beyond those listed under "absolute maximum ratings" may cause permanent damage 
to the device. These are stress ratings only and functional operation of the device at these or 
any other conditions beyond those indicated under "recommended operating conditions" is 
not implied. Exposure to absolute-maximum-rated conditions for extended periods may affect 
device reliability. 

recommended operating conditions 



PARAMETER 


SN74ACT8847 


UNIT 


MIN NOM MAX 


Vqc Supply voltage 


4.75 5.0 5.25 


V 


V|j-j High-level input voltage 




V 


V|'l Low-level input voltage 


o , Jtfrs 


V 


Iqh High-level output current 


- 8 


mA 


Iql Low-level output current 


,A h 8 


mA 


Vj Input voltage 


o v cc 


V 


Vq Output voltage 


o v C c 


V 


dt/dv Input transition rise or fall rate 


0 15 


ns/V 


Ta Operating free-air temperature 


0 70 


°C 



V) 
2 

> 
O 
H 
00 
00 
4* 
^1 
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electrical characteristics over recommended operating free-air 
temperature range (unless otherwise noted) 



PARAMETER 


TEST CONDITIONS 


v C c 


T. _ OCOp 


OIV / HMU 1 00*+ / 


UNIT 


MIIM TYP MAY 

IVIIIM 1 Ir IVIM/V 


MliM TYP MAX 

IVIIIM 1 Ir IVIM/V 


V 0 H 


'OH - 20 pA 


A R \/ 






V 


5.5 V 






Iqh - 8 mA 


4.5 V 




3.76 


5.5 V 




4.76 


vol 


l 0 L = 20 iiA 


4.5 V 






V 


5.5 V 




m *■ -. - 


Iql = 8 mA 


4.5 V 




0.45 


5.5 V 


: ' ' ' 


0.45 


»i 


V| = V C C or 0 


5.5 V 




±1 


(iA 


•cc 


V| » V C C or 0, lo 


5.5 V 




200 


IiA 


Ci 


Vj = V C C or 0 


5 V 






PF 



switching characteristics (see Note) 



PARAMETER 


SN74ACT8847-30 
MIN MAX 


UNIT 


tpdl 


Propagation delay from DA/DB/I input register 
to Y output 


72 


ns 


tpd2 


Propagation delay from input register to 
output buffer 


70 


ns 


^3 


Propagation delay from pipeline register to 
output buffer 


45 


ns 


l pd4 


Propagation from output register to 
output buffer 


18 


ns 


tpd5 


Propagation delay from SELMS/LS to Y output 


18 


ns 


td1 


Propagation delay time, input register to 
output register 


56 


ns 


td2 


Delay time, input register to pipeline register or 
pipeline register to output register 


30 


ns 


td3 


Delay time, CLKC after CLK to insure data 
captured in C register is data clocked into the 
sum or product register by that clock 


8 


ns 


td4 


Delay time, CLKC after CLK to insure data 
captured in C register is data clocked into the 
sum or product register by the previous clock 


2 


ns 



m 



Note: Switching data must be used with timing diagrams for different operating modes. 
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setup and hold times 



PARAMETER 


SN74ACT8847-30 


UNIT 


MIN MAX 


t su Setup time 


Instruction before CLKt 


10 


ns 


Data operand before CLKt 


10 


Data operand before second CLKI for 
double-precision operation (input register 
not enabled) 


40 wir*^ 


SRCC with respect to CLKC 


lic^ 1 r 

.„, 


t n Hold time 


Instruction input after CLKt 


0 


ns 


Valid Y bus output of the previous CLK 
cycle after rising clock edge 


5.5 


Valid status output of the previous CLK 
cycle after rising clock edge 


3 



clock requirements 



PARAMETER 


SN74ACT8847-30 


UNIT 


MIN MAX 


t w Pulse duration 


CLK high 


10 


ns 


CLK low 


10 


CLK low t 


10 


Clock period 




ns 



^Clock mode 1 cannot be used. 




CO 

z 

> 

O 
H 
00 
00 
£> 
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switching characteristics (see Note) 



PARAMETER 


SN74ACTRR47-40 

Wll 1 ~T#"%V* 1 00**T / "TV 

MIN MAX 


UNIT 


tpdl 


Propagation delay from DA/DB/I input register 

to V m itrw it 


95 


ns 


tpd2 


Propagation delay from input register to 

m itnnt Ki iff or 


90 


ns 


tpd3 


Propagation delay from pipeline register to 
output buffer 


60 


ns 


*pd4 


Prnnanatinn from mitmit ronictPr tr» 

output buffer 


JS^ 20 


ns 


*pd5 


Propagation delay from SELMS/LS to Y output 


20 

,/V 


ns 


td1 


Propagation delay time, input register to 
output register 


* 75 


ns 


td2 


Delay time, input register to pipeline register or 
pipeline register to output register 


^,|C"' ! : — 

% 40 


ns 


td3 


Delay time, CLKC after CLK to insure data 
captured in C register is data clocked into the 
sum or product register by that clock 


9 


ns 


kl4 


Delay time, CLKC after CLK to insure data 
captured in C register is data clocked into the 
sum or product register by the previous clock 


2 


ns 



Note: Switching data must be used with timing diagrams for different operating modes. 
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setup and hold times 



PARAMETER 


SIM74ACT8847-40 
MIN MAX 


UNIT 


t su Setup time 


Instruction before CLKt 


1 2 


ns 


Data operand before CLKt 


12 


Data operand before second CLKt for 
double-precision operation (input register 
not enabled) 


52 ■ l : o ; 


SRCC with respect to CLKC 


4.5 


t n Hold time 


Instruction input after CLKI 


0 


ns 


Valid Y bus output of the previous CLK 
cycle after rising clock edge 


5.5 


Valid status output of the previous CLK 
cycle after rising clock edge 


4 


clock requirements 


PARAMETER 


SN74ACT8847-40 
MIN MAX 


UNIT 


t w Pulse duration 


CLK high 


15 , 


ns 


CLK low 


15 ' ' 


CLK low 1 " 


10 


Clock period 




ns 



^Clock mode 1 cannot be used. 
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switching characteristics (see Note) 



PARAMETER 


SN74ACT8847-60 
MIN MAX 


UNIT 


tpdl 


Propagation delay from DA/DB/I input register 
to Y output 


125 


ns 


*pd2 


Propagation delay from input register to 
output buffer 


120 


ns 


*pd3 


Propagation delay from pipeline register to 
output buffer 


75 


ns 


*pd4 


Propagation from output register to 
output buffer 


28 


ns 


tpd5 


Propagation delay from SELMS/LS to Y output 


28 


ns 




Pronanation Hplav timp innut rpoi«?tpr tn 

■ * \J o y ci li w 1 1 v-iv/icjy niiivs, iiii^sui. iu^ioivi iu 

output register 


100 


ns 


td2 


Delay time, input register to pipeline register or 
pipeline register to output register 


60 


ns 


td3 


Delay time, CLKC after CLK to insure data 
captured in C register is data clocked into the 
sum or product register by that clock 


12 


ns 


*d4 


Delay time, CLKC after CLK to insure data 
captured in C register is data clocked into the 
sum or product register by the previous clock 


2 


ns 



Note: Switching data must be used with timing diagrams for different operating modes. 




00 
00 

o 
< 

2 
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setup and hold times 



PARAMETER 


SN74ACT8847-60 
MIN MAX 


UNIT 


t su Setup time 


Instruction before CLKt 


16 


ns 


Data operand before CLKt 


16 


Data operand before second CLKt for 
double-precision operation (input register 
not enabled) 


75 


SRCC with respect to CLKC 


6 


t n Hold time 


Instruction input after CLKI 


0 


ns 


Valid Y bus output of the previous CLK 
cycle after rising clock edge 


5.5 


Valid status output of the previous CLK 
cycle after rising clock edge 


4 


clock requirements 


PARAMETER 


SN74ACT8847-60 
MIN MAX 


UNIT 


t w Pulse duration 


CLK high 


20 


ns 


CLK low 




CLK low* 


n 


Clock period 




ns 



t Clock mode 1 cannot be used. 




C/) 



> 

o 

H 

00 
00 
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SN74ACT8847 FLOATING POINT UNIT 



The SN74ACT8847 is a high-speed floating point unit implemented in Tl's advanced 
1-/>im CMOS technology. The device is fully compatible with IEEE Standard 754-1985 
for addition, subtraction, multiplication, division, square root, and comparison. 

The 'ACT8847 FPU also performs integer arithmetic, logical operations, and logical 
shifts. Absolute value conversions, floating point to integer conversions, and integer 
to floating point conversions are available. The ALU and multiplier are both included 
in the same device and can be operated in parallel to perform sums of products and 
products of sums (see Figure 1). 

IEEE formatted denormal numbers are directly handled by the ALU. Denormal numbers 
must be wrapped by the ALU before being used in multiplication, division, or square 
root operations. A fast mode in which all denormals are forced to zero is provided 
for applications not requiring gradual underflow. 

The 'ACT8847 input buses can be configured to operate as two 32-bit data buses 
or as a single 64-bit bus, providing a number of system interface options. Registers 
are provided at the inputs, outputs, and inside the ALU and multiplier to support 
multilevel pipelining. These registers can be bypassed for nonpipelined operation. 

A clock mode control allows the temporary input register to be clocked on the rising 
edge or the falling edge of the clock to support double-precision ALU operations at 
the same rate as single-precision operations. A feedback register (C register) with a 
separate clock is provided for temporary internal storage of a multiplier result, ALU 
result or constant. 

To ensure data integrity, parity checking is performed on input data, and parity is 
generated for output data. A master/slave comparator supports fault-tolerant system 
design. Two test pin control inputs allow all l/Os and outputs to be forced high, low, 
or placed in a high-impedance state to facilitate system testing. 

Data Flow 

Data enters the 'ACT8847 through two 32-bit input data buses, DA and DB. The buses 
can be configured to operate as a single 64-bit data bus for double precision operations 
(see Table 3). Data can be latched in a 64-bit temporary register or loaded directly 
into the RA and RB registers for input to the multiplier and ALU. 

Four multiplexers select the multiplier and ALU operands from the input registers, 
C register or previous multiplier or ALU result. Results are output on the 32-bit Y bus; 
a Y output multiplexer selects the most significant or least significant half of the result 
if a double-precision number is being output. The 64-bit G register is provided for 
temporary storage of a result from the ALU or multiplier. 



7-33 



FROM 

INSTRUCTION 
REGISTER 




11010 

SELOP7 SELOPO 
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« vcc 

/ 17 4 GND 



Figure 1 . 'ACT8847 Floating Point Unit 



7-34 



Input Data Parity Check 

When BYTEP is high, internal odd parity is generated for each byte of input data at 
the DA and DB ports and compared to the PA and PB parity inputs. If an odd number 
of bits is set high in a data byte, the parity bit for that byte is also set high. Parity 
bits are input on PA for DA data and PB for DB data. PAO and PBO are the parity bits 
for the least significant bytes of DA and DB, respectively. If the parity comparison 
fails for any byte, a high appears on the parity error output pin (PERRA for DA data 
and PERRB for DB data). 

A parity check can also be performed on the entire input data word by setting BYTEP 
low. In this mode, PAO is the parity input for DA data and PBO is the parity input for 
DB data. 

Temporary Input Register 

A temporary input register is provided to enable loading of two double- precision 
numbers on two 32-bit input buses in one clock cycle. The contents of the DA bus 
are loaded into the upper 32 bits of the temporary register; the contents of DB are 
loaded into the lower 32 bits. 

A clock mode signal (CLKMODE) determines the clock edge on which the data will 
be stored in the temporary register. When CLKMODE is low, data is loaded on the 
rising edge of the clock. With CLKMODE set high, the temporary register loads on 
a falling edge and the RA and RB registers can then be loaded on the next rising edge. 

R A and RB Input Registers 

Two 64-bit registers, RA and RB, are provided to hold input data for the multiplier 
and ALU. Data is taken from the DA bus, DB bus and the temporary input register, 
according to configuration mode controls CONFIG 1-CONFIGO (see Tables 3 and 5). 
The registers are loaded on the rising edge of clock CLK. For single-precision operations, 
CONFIG 1-CONFIGO should ordinarily be set to 0 1 (see Table 4). 




I s * 

00 
00 

O 
< 

2 
C/5 
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Table 3. Double Precision Input Data Configuration Modes 





LOADING SEQUENCE 


DATA LOADED INTO TEMP 
REGISTER ON FIRST CLOCK 
AND RA/RB REGISTERS ON 
SECOND CLOCK "1" 


DATA LOADED INTO RA/RB 
REGISTERS ON SECOND 
CLOCK 


DA DB 


DA DB 


0 0 

0 1 

1 0 
1 1 


B operand B operand 

(MSH) (LSH) 
A operand B operand 

(LSH) (LSH) 
A operand B operand 

(MSH) (MSH) 
A operand A operand 

(MSH) (LSH) 


A operand A operand 

(MSH) (LSH) 
A operand B operand 

(MSH) (MSH) 
A operand B operand 

(LSH) (LSH) 
B operand B operand 

(MSH) (LSH)) 



^On the first active clock edge (see CLKMODE, Table 62), data in this column is loaded into the temporary 
register. On the next rising edge, operands in the temporary register and the DA/DB buses are loaded into 
the RA and RB registers. 



Table 4. Single Precision Input Data Configuration Mode 



CONFIG 1 CONFIGO 


DATA LOADED INTO 
RA/RB REGISTERS ON 
FIRST CLOCK 

DA DB 


NOTE 


0 1 


A operand B operand 


This mode is ordinarily used for single- 
precision operations. 



Table 5. Double Precision Input Data Register Sources 



m 



CONFIG 1 CONFIGO 


RA SOURCE 


RB SOURCE 


MSH LSH 


MSH LSH 


0 0 

0 1 

1 0 

1 1 


DA DB 

TEMP REG 

DA 

(MSH) 
TEMP REG DA 
(MSH) 

TEMP REG TEMP REG 
(MSH) (LSH) 


TEMP REG TEMP REG 
(MSH) (LSH) 
DB TEMP REG 
(LSH) 

TEMP REG nD 
Dd 

(LSH) 
DA DB 



00 
00 
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Multiplier/ALU Multiplexers 

Four multiplexers select the multiplier and ALU operands from the RA and RB registers, 
the previous multiplier or ALU result, or the C register. The multiplexers are controlled 
by input signals SELOP7-SELOP0 as shown in Tables 6 and 7. For division and square 
root operations, operands must be sourced from the input registers, RA and RB. 



Table 6. Multiplier Input Selection 



A1 (MUX1) INPUT 


B1 (MUX2) INPUT 


SELOP7 SELOP6 


OPERAND SOURCE* 


SELOP5 SELOP4 


OPERAND SOURCE* 


0 0 

0 1 

1 0 

1 1 


Reserved 
C register 
ALU feedback 
RA input register 


0 0 

0 1 

1 0 

1 1 


Reserved 
C register 
Multiplier feedback 
RB input register 



For division or square root operations, only RA and RB registers can be selected as sources. 



Table 7. ALU Input Selection 



A2 (MUX3) INPUT 


B2 (MUX4) INPUT 


SELOP3 SELOP2 


OPERAND SOURCE* 


SELOP1 SELOPO 


OPERAND SOURCE* 


0 0 

0 1 

1 0 

1 1 


Reserved 
C register 
Multiplier feedback 
RA input register 


0 0 

0 1 

1 0 

1 1 


Reserved 
C register 
ALU feedback 
RB input register 



For division or square root operations, only RA and RB registers can be selected as sources. 



Pipelined ALU 

The pipelined ALU contains a circuit for floating point addition and/or subtraction of 
aligned operands, a pipeline register, an exponent adjuster and a normalizer/rounder. 
An exception circuit is provided to detect denormal inputs; these can be flushed to 
zero if the fast input is set high. If the FAST input is low, the ALU accepts a denormal 
as input. A denorm exception flag (DENORM) goes high when the ALU output is a 
denormal. ^ 

Integer processing in the ALU includes both arithmetic and logical operations on either qq 

two's complement numbers or unsigned integers. The ALU performs addition, 00 

subtraction, comparison, logical shifts, logical AND, logical OR, and logical XOR. q 

< 

The ALU may be operated independently or in parallel with the multiplier. Possible ALU ^ 

functions during independent operation are given in Tables 8 through 1 1 . Parallel ^ 

ALU/multiplier functions are listed in Table 16. jjj 
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Table 8. Independent ALU Operations, Single Floating-Point Operand 
(110 = 0,19 = 0, 17 = 0, 16 =0) 



CHAINED 
OPERATION 
110 



OPERAND 
FORMAT 
19 



PRECISION 
RA 
18 



PRECISON 
RB 
17 



OUTPUT 
SOURCE 
16 



OPERAND 
TYPE 
15 



ABSOLUTE 
VALUE A 
14 



ALU OPERATION 



I3-I0 



RESULT 



0 = Not 
Chained 



0 = 
Floating 
point 



0 = A(SP) 

1 = A(DP) 



0 = B(SP) 



0 = ALU 
result 



1 = Single 
Operand 



0 = A 
1 = |A| 



0000 
0001 
0010 



0011 



0100 



0101 
0110 

0111 



1000 

1010 

1100 
1101 
1110 



Pass A operand 
Pass -A operand 
2's complement integer 
to floating point 
conversion 1 " 
Floating point to 2's 
complement integer 
conversion 

Move A operand (pass 
without NaN detect or 
status flags active) 
Pass B operand 
Floating point to floating 
point conversion* 
Floating point to 
unsigned integer 
conversion 

Wrap (denormal) input 
operand 

Unsigned integer to 
floating point conversion 
Unwrap exact number 
Unwrap inexact number 
Unwrap rounded input 



^The precision of the integer to floating point conversion is set by 18. 

*This converts single-precision floating point to double-precision floating point and vice versa. If the 18 pin is low to indicate a single-precision input, the result 
of the conversion will be double precision. If the 18 pin is high, indicating a double-precision input, the result of the conversion will be single precision. 



Table 9. Independent ALU Operations, Two Floating-Point Operands 
(110 = 0, 19 - 0, 15 - 0) 



CHAINED 


OPERAND 


PRECISION 


PRECISION 


OUTPUT 


OPERAND 


ABSOLUTE 


ABSOLUTE 


ABSOLUTE 


ALU OPERATION 


OPERATION 


FORMAT 


RA 


RB 


SOURCE 


TYPE 


VALUE A 


VALUE B 


VALUE Y 






110 


19 


18 


17 


16 


15 


14 


13 


!2 


11-10 


RESULT 


0 - Not 
chained 


0 = 
Floating 
point 


0 = A(SP) 

1 = A(DP) 


0 - B(SP) 

1 = B(DP) 


0 = ALU 
result . 


0 = Two 
operands 


0 = A 
1 = |A| 


0 = B 

1 = |B| 


0 = Y 

1>.|Y| 


00 
01 
10 

11 


A + B 
A - B 

Compare A, B 
B - A 



Table 10. Independent ALU Operations, Single Integer Operand 
(110 = 0, 19 = 1, 16 « 0) 



CHAINED 
OPERATION 

no 


OPERAND FORMAT/PRECISION 


OUTPUT 
SOURCE 

16 


OPERAND 
TYPE 

15 


ALU OPERATION 


19 


18 


17 


14-10 


RESULT 


0 = Not 


1 = 


0 


0 - SP 2's 


0 - ALU 


1 = Single 


00000 


Pass A operand 


Chained 


Integer 




complement 


result 


Operands 


00001 


Pass - A operand 






0 


1 = SP 






00010 


Negate A operand ( 1 's complement) 














00101 


Pass B operand 








unsigned 






01000 


Shift A operand left logical * 








integer 






01001 


Shift A operand right logical ^ 














01101 


Shift A operand right arithmetic ^ 



B operand is number of bit positions A is to be shifted (See instruction description for "Independent ALU Operations".) 
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Table 1 1 . Independent ALU Operations, Two Integer Operands 
(110 « 0, 19 = 1/16 = 0) 



CHAINED 
OPERATION 
110 


OPERAND FORMAT/PRECISION 


OUTPUT 
SOURPF 
16 


OPERAND 
TYPE 
15 


ALU OPERATION 


19 


18 


17 


14-10 


RESULT 


0 = Not 


1- = 


0 


0 = SP 2's 


0 = ALU 


0 = Two 


00000 


A + B 


Chained 


Integer 




complement 


result 


Operands 


00001 


A - B 






0 


1 = SP 






00010 


Compare A, B 








unsigned 






00011 


B -A 








integer 






01000 


Logical AND (A, B) 














01001 


Logical AND (A, NOT B) 














01010 


Logical AND (NOT A, B) 














01100 


Logical OR (A, B) 














01101 


Logical XOR (A, B) 



Table 12. Independent Multiplier Operations 



CHAINED 
OPERATION 

no 


OPERAND FORMAT/PRECISION 

19 18 17 


OUTPUT 
SOURCE 
16 


MULTIPLY/ 
DIVIDE 
15 


ABSOLUTE 
VALUE A 
I4t 


ABSOLUTE 
VALUE B/ 
DIV/SQRT 
I3t 


NEGATE 
RESULT 
I2t 


WRAP A 

11 


WRAP B 
10 


0 = Not 
chained 


0 =. 0 = A(SP) 0 - B(SP) 
floating 1 ' = A(DP) 1 - B(DP) 
point 

1 = 0 0 = SP2's 
integer complement 

0 1 - SP 
unsigned 
integer 


1 ■ = 
Multi- 
plier 
result 


0 = 0 ' = A 0 = B 
multiply 1 = jA| 1 ' = |B| 

1 . =.. 0 - A 0 - Div 
Div/SQRT 1 = |A| 1 = SQRT 


0 = Y 

1.= -Y 


0 = Normal 

format 
1 = A is a 
wrapped 
number 


0 = Normal 

format 
1 = B is a 
wrapped 
number 



See also Tables 13 and 14. Operations involving absolute values or negated results are valid only when floating-point format is selected (19 = 0). 



Pipelined Multiplier 

The pipelined multiplier performs a basic multiply function/ division and square root. 
The operands can be single-precision or double-precision numbers and can be converted 
to absolute values before multiplication takes place. Independent multiplier operations 
are summarized in Tables 12 through 15. 

An exception circuit is provided to detect denormalized inputs; these are indicated 
by a high on the DENIN signal. Denormalized inputs must be wrapped by the ALU before 
multiplication, division, or square root. If results are wrapped (signaled by a high on 
the DENORM status pin), they must be unwrapped by the ALU. 

The multiplier and ALU can be operated simultaneously by setting the 110 instruction 
input high. Possible operations in this chained mode are listed in Table 16. Division 
and square root are performed as independent multiplier operations, even though both 
multiplier and ALU are active during divide and SQRT operations. 



Table 13. Independent Multiply Operations Selected by 14-12 (110 = 0, 16 = 1, 15 - 0) 



ABSOLUTE 
VALUE A 
14 


ABSOLUTE 
VALUE B 
13 


NEGATE 
RESULT 
12 


OPERATION SELECTED 


14-12 


RESULTS * 


0 = A 


0 - B 


0 ■■=■ Y 


000 


A * B 


1 = |A| 


1 = |B| 


1 = ~ Y 


001 


-(A'* B) 








010 


A * | B | ■ 








011 


-(A * |B|) 








100 


|Af * B 








101 


-(|A| *B). 








110 


|A| * |B| 








111 


-<iai; *.|BD 



Table 14. Independent Divide/Square Root Operations 
Selected by I4-I2 (110 0, 16 - 1, 15 - 1) 



ABSOLUTE 
VALUE A 
14 


DIVIDE/ 
SORT 
13 


NEGATE 
RESULT 
12 


OPERATION SELECTED 


14 12 


RESULTS* 


0 = A 


0 = Divide 


0 = Y 


000 


A / B 


1 = A 


1 = SQRT 


1 = - Y 


001 


-(A / B) 








010 


SQRT A 








011 


-(SQRT A) 








100 


|A|/B 








101 


-(.|A| IB) 








110 


SQRT |Aj 








111 


-(SQRT |A|) 



El 



Operations involving absolute values or negated results are valid only when floating point format is selected 
(19 =-0). 
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Table 15. Formats Selected by I8-I7 (I10 - 0 f 19 -0, 16 - 1) 



PRECISION 
SELECT RA 
18 


PRECISION 
RA INPUT 


PRECISION 
SELECT RB 
17 


PRECISION 
RB INPUT 


PRECISION 
OF RESULT 


0 


Single 


0 


Single 


Single 




Single 








0 


(Converted 


1 


Double 


Double 




to Double) 














Single 




1 


Double 


0 


(Converted 


Double 








to Double) 




1 


Double 


1 


Double 


Double 



Product, Sum, and C Registers 

The results of the ALU and multiplier operations may optionally be latched into two 
output registers on the rising edge of the system clock (CLK). The P (product) register 
holds the result of the multiplier operation; the S (sum) register holds the ALU result. 

An additional 64-bit register is provided for temporary storage of the result of an ALU 
or multiplier opration before feedback to the multiplier or ALU. The data source for 
this C register is selected by SRCC; a high on this pin selects the multiplier result; 
a low selects the ALU. A separate clock, CLKC, has been provided for this register. 

Parity Generators 

Odd parity is generated for the Y multiplexer output, either for each byte or for each 
word of output, depending on the setting of BYTEP. When BYTEP is high, the parity 
generator computes four parity bits/one for each byte of Y multiplexer output. Parity 
bits are output on the PY3-PY0 pins; PYO represents parity for the least significant 
byte. A single parity bit can also be generated for the entire output data word by setting 
„„ .,.,„ ^ .,- , : . BYTEP low. In this mode, PYO is the parity output. 

HHIQ Master/Slave Comparator 

A master/slave comparator is provided to compare data bytes from the Y. output 
multiplexer and the st atus outputs with data bytes on the external Y and status ports 

^ when OEY, OES and OEC are high. If the data bytes are not equal, a high signal is 

-P> generated on the master/slave error output pin (MSERR). 

> 

O Status and Exception Generation 

00 A status and exception generator produces several output signals to indicate invalid 
2* operations as well as overflow, underflow, nonnumerical and inexact results, in 
*"nI conformance with IEEE Standard 754-1985. If output registers are enabled 
(PIPES2 = 0), status and exception results are latched in a status register on the rising 
edge of the clock. Status results are valid at the same time that associated data results 
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Table 16. Chained Multiplier/ALU Operations (110 = 1) 

















NEGATE 


NEGATE 


ALU 
OPERATIONS 


CHAINED 


OPERAND FORMAT/PRECISION 


OUTPUT 


ADD 


MUI TIPL Y 

IVl UL. 1 IrL I 


ALU 


MUI TIPI IFR 

IVIVsL. 1 lr Lit ii 


OPERATION 








SOURCE 


ZERO 


BY ONE 


RESULT 


RESULT 


110 


19 


18 


17 


16 


15 


14 


I3t 


I2t 


11-10 


RESULT 


1 = 


0 = 


0 = A(SP) 


0 = B(SP) 


0 - 


0 = 


0 = 


0 = 


0 = 


00 


A + B 


Chained 


floating 


1 = A(DP) 


1 = B(DP) 


ALU 


Normal 


Normal 


Normal 


Normal 


01 


A - B 




point 






result 


operation 


operation 


operation 


operation 


10 


2 - A 












1 = 


1 = 


1 = 


1 = 


11 


B - A 




1 = 


0 


0 = SP 2's 


1 = 


Forces 


Forces 


Negate 


Negate 








integer 




complement 


Multi- 


B2 input 


B1 input 


ALU 


multiplier 










0 


1 = SP 
unsigned 
integer 


plier 
result 


of ALU 
to zero 


of multi- 
plier to 
one 


result 


result 







Operations involving negated results are valid only when floating-point format is selected (19 - 0). 
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are valid. Status outputs are enabled by two signals, OEC for comparison status and 
OES for other status and exception outputs/ Status outputs are summarized in 
Tables 17 and 18. 



Table 17. Comparison Status Outputs 



SIGNAL 


RESULT OF COMPARISON (ACTIVE HIGH) 


AEQB 

AGTB 
UNORD 


The A and B operands are equal. (A high signal on the AEQB output indicates a 
zero result from the selected source except during a compare operation in the ALU. 
During integer operations, Indicates zero status output.) 
The A operand is greater than the B operand. 

The two inputs of a comparison operation are unordered, i.e., one or both of the 
inputs is a NaN. 


Table 18. Status Outputs 


SIGNAL 


STATUS RESULT 


CHEX 

DENIN 

DENORM 

DIVBYO 
ED 

INEX 
INF 
IVAL 

NEG 
OVER 
RNDCO 

SRCEX 

STEXO 
STEX1 
UNDER 


If 16 is low, indicates the multiplier is the source of art exception during a chained 
function. If 16 is high, indicates the ALU is the source of an exception during a 
chained function. 

Input to the multiplier is a denorm. When DENIN goes high, the STEX pins indicate 
which port had the denormal input. 

The multiplier output is a wrapped number or the ALU output is a denorm. In the 

FAST mode, this condition causes the result to go to zero. 

An invalid operation involving a zero divisor has been detected by the multiplier. 

Exception detect status signal representing logical OR of all enabled exceptions 

in the exception disable register. 

The result of an operation is not exact. 

The output is the IEEE representation of infinity. 

A NaN has been input to the multiplier or the ALU, or an invalid operation 
(0 • oo or ± oo ± oo ) has been requested. This signal also goes high if an operation 
involves the square root of a negative number. When IVAL goes high, the STEX 
pins indicate which port had the NaN. 
Output value has negative sign 

The result is greater than the largest allowable value for the specified format. 
The mantissa of a wrapped number has been increased in magnitude by rounding 
and the unwrap round instruction must be used to properly unwrap the wrapped 
number (see Table 8). 

The status was generated by the multiplier. (When SRCEX is low* the status was 
generated by the ALU.) 

A NaN or a denorm has been input on the B port. 
A NaN or a denorm has been input on the A port. 

The result is inexact and less than the minimum allowable value for the specified 
format. In the FAST mode, this condition causes the result to go to zero. 
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An exception mask register is available to mask selected exceptions from the multiplier, 
ALU, or both. Multiply status is disabled during an independent ALU instruction, and 
ALU status is disabled during multiplier instructions. During chained operation both 
status outputs are enabled. 

When the exception mask register has been loaded with a mask, the mask is applied 
to the contents of the status register to disable unnecessary exceptions. Status results 
for enabled exceptions are then ORed together and, if true, the exception detect (ED) 
status output pin is set high. Individual status outputs remain active and can be read 
independently from mask register operations. 

During a compare operation in the ALU, the AEQB output goes high when the A and 
B operands are equal. When any operation other than a compare is performed, either 
by the ALU or the multiplier, the AEQB signal is used as a zero detect. 

In chained mode, results to be output are selected based on the state of the 16 (source 
output) pin (if 16 is low, ALU status will be selected; if 16 is high, multiplier status 
will be selected). If the nonselected output source generates an exception, CHEX is 
set high. Status of the nonselected output source can be forced using the SELST pins, 
as shown in Table 19. 



Table 19. Status Output Selection (Chained Mode) 



SELST 1- 




SELSTO 


STATUS SELECTED 


00 


Logical OR of ALU and multiplier exceptions (bit by bit) 


01 


Selects multiplier status 


10 


Selects ALU status 


11 


Normal operation (selection based on result source specified by 16 input) 



Flowthrough Mode 

To enable the device to operate in pipelined or flowthrough modes, registers can be 
bypassed using pipeline control signals PIPES2-PIPES0 (see Table 20). 



Table 20. Pipeline Controls (PIPES2-PIPES0) 



oo 
oo 
h- 
O 
< 

Z 
C/5 



PIPES2- 
PIPESO 



REGISTER OPERATION SELECTED 



Enables input registers (RA, RB) 

Disables input registers (RA, RB) 

Enables pipeline registers 

Disables pipeline registers 

Enables output registers (P, S, Status) 

Disables output registers (P, S, Status) 



X 0 

X 1 

0 X 

1 X 
X X 
X X 
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FAST and IEEE Modes 



The device can be programmed to operate in FAST mode by asserting the FAST pin. 
In the FAST mode, all denormalized inputs and outputs are forced to zero. 

Placing a zero on the FAST pin causes the chip to operate in IEEE mode. In this mode, 
the ALU can operate on denormalized inputs and return denormals. If a denorm is input 
to the multiplier, the DENIN flag will be asserted, and the result will be invalid. Denormal 
numbers must be wrapped before being input to the multiplier. If the multiplier result 
underflows, a wrapped number will be output. 

Rounding Modes 

The 'ACT8847 supports the four IEEE standard rounding modes: round to nearest, 
round towards zero (truncate), round towards infinity (round up), and round towards 
minus infinity (round down). The rounding function is selected by control pins RND1 
and RNDO, as shown in Table 21. 



C/> 

z 

> 

O 
H 
00 
00 

■p> 



Table 21. Rounding Modes 



RND1- 
RNDO 



0 0 

0 1 

1 0 

1 1 



ROUNDING MODE SELECTED 



Round towards nearest 

Round towards zero (truncate) 

Round towards infinity (round up) 

Round towards negative infinity (round down) 




Test Pins 

Two pins, TP1-TP0, support system testing. These may be used, for example, to place 
all outputs in a high-impedance state, isolating the chip from the rest of the system 
(see Table 22). 

Table 22. Test Pin Control Inputs 



TP1- 
TPO 



0 0 

0 1 

1 0 

1 1 



OPERATION 



All outputs and l/Os are forced low 

All outputs and l/Os are forced high 

All outputs are placed in a high impedance state 

Normal operation 



Summary of Control Inputs 

Control input signals for the 'ACT8847 are summarized in Table 23. 
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Table 23. Control Inputs 



SIGNAL 


HIGH 


LOW 


BYTEP 


Selects byte parity generation and test 


Selects single bit parity 






generation and test 




Clocks all registers (except C) on rising 


No effect 




edge 






Clocks C register on rising edge 


IMO cTTcCl 


ULIxlVIUUt 


Enables temporary input register load on 


Enables temporary input register 




falling clock edge 


load on rising clock edge 


CONFIG 1 - 


See Table 3 (RA and RB register data 


bee Table 4z (RA and RB 


CONFIGO 


source selects) 


register data source selects) 


fcNC d 


No effect 


Enables C register load when 






ulpvL. goes nign. 


ENRA 


If register is not in flow through, enables 


If register is not in flow through, 




clocking of RA register 


through, holds contents of RA 






register 


ENRB 


If register is not in flow through, enables 


If register is not in flow through, 




enables clocking of RB register 


holds contents of RB register 


r-ACT 

PAb 1 


Places device in FAST mode 


Places device in IEEE mode 


FLOW C 


Causes output value to bypass C 


IV 1 — n XX x 

No effect 




register and appear on C register output 






bus. 




HALT 


No effect 


Stalls device operation but 






does not affect registers, internal 






states, or status 


, 

OEC 


Disables compare pins 


Enables compare pins 


OES 


Disables status outputs 


Enables status outputs 


OEY 


Disables Y bus 


enables Y bus 


ninrc O 

PIPcSz- 


bee Table zu (Pipeline Mode Control) 


See Table 20 (Pipeline Mode 


DIDCCA 

rlrfcbU 




L/Ontrou 


RESET 


(V 1 _ „ XX _ _ j. 

No effect 


Clears internal states, status, 






internal pipeline registers, and 






exception disable register. Does 






not affect other data registers. 


RND1 - 


bee Table zl (Hounding Mode Control) 


bee I able zl (Hounding Mode 


RNDO 




Control) 


bfcLUr /- 


bee l ables b and / (Multiplier/ALU 


See Tables 6 and 7 


CETI ADA 

bcLOPU 


operand 


(Multiplier/ALU operand selection 




selection) 




SELMS/LS 


Selects MSH of 64-bit result for output 


Selects LSH of 64-bit result for 




output on the Y bus (no effect on single- 


output on the Y bus (no effect on 




precision operands) 


single-precision operands) 


SELST1- 


See Table 19 (Status Output Selection) 


See Table 19 (Status Output 


SELSTO 




Selection) 


SRCC 


Selects multiplier result for input to C 


Selects ALU result for input to C 




register 


register 


TP1-TP0 


See Table 22 (Test Pin Control Inputs) 


See Table 22 (Test Pin Control 






Inputs) 
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Instruction Set 

Configuration and operation of the 'ACT8847 can be selected to perform single or 
double-precision floating-point and integer calculations in operating modes ranging from 
flowthrough to fully pipelined. Timing and sequences of operations are affected by 
settings of clock mode, data and status registers, input data configurations, and 
rounding mode, as well as the instruction inputs controlling the ALU and the multiplier. 
The ALU and the multiplier of the 'ACT8847 can operate either independently or 
simultaneously, depending on the setting of instruction inputs 110-10 and related 
controls. 

Controls for data flow and status results are discussed separately, prior to the 
discussions of ALU and multiplier operations. Then, in Tables 27 through 35, the 
instruction inputs to the ALU and the multiplier are summarized according to operating 
mode, whether independent or chained (ALU and multiplier in simultaneous operation). 

Loading External Data Operands 

Patterns of data input to the ' ACT8847 vary depending on the precision of the operands 
and whether they are being input as A or B operands. Loading of external data operands 
is controlled by the settings of CLKMODE and CONFIG 1-CONFIGO, which determine 
the clock timing for loading and the registers that are used. 

Configuration Controls (CONFIG 1-CONFIGO) 

Three input registers are provided to handle input of data operands, either single 
precision or double precision. The RA, RB, and temporary registers are each 64 bits 
wide. The temporary register is (ordinarily) used only during input of double-precision 
operands. 

When single-precision or integer operands are loaded, the ordinary setting of CONFIG 1- 
CONFIGO is LH, as shown in Table 4. This setting loads each 32-bit operand in the 
most significant half (MSH) of its respective register. The operands are loaded into 
the MSHs and adjusted to double precision because the data paths internal to the device 
are all double precision. It is also possible to load single-precision operands with 
CONFIG 1-CONFIGO set to HH but two clock edges are required to load both the A 
and B operands on the DA bus. 

Double-precision operands are loaded by using the temporary register to store half 
of the operands prior to inputting the other half of the operands on the DA and DB 
buses. As shown in Tables 3 and 5, four configuration modes for selecting input sources 
are available for loading data operands into the RA and RB registers. 

CLKMODE Settings 

Timing of double-precision data inputs is determined by the clock mode setting, which 
allows the temporary register to be loaded on either the rising edge (CLKMODE = L) 
or the falling edge of the clock (CLKMODE = H). Since the temporary register is not 
used when single-precision operands are input, clock modes 0 and 1 are functionally 
equivalent for single-precision operations. 
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The setting of CLKMODE can be used to speed up the loading of double-precision 
operands. When the CLKMODE input is set high, data on the DA and DB buses are 
loaded on the falling edge of the clock into the MSH and LSH, respectively, of the 
temporary register. On the next rising edge, contents of the DA bus, DB bus, and 
temporary register are loaded into the RA and RB registers, and execution of the current 
instruction begins. The setting of CONFIG1 -CONFIGO determines the exact pattern 
in which operands are loaded, whether as MSH or LSH in RA or RB. 

Double-precision operation in clock mode 0 is similar except that the temporary register 
loads only on a rising edge. For this reason the RA and RB registers do not load until 
the next rising edge, when all operands are available and execution can begin. 

A considerable advantage in speed can be realized by performing double-precision ALU 
operations with CLKMODE set high. In this clock mode both double-precision operands 
can be loaded on successive clock edges, one falling and one rising and the ALU 
operation can be executed in the time from one rising edge of the clock to the next 
rising edge. Both halves of a double-precision ALU result must be read out on the Y 
bus within one clock cycle when the 'ACT8847 is operated in clock mode 1. 

Internal Register Operations 

Six data registers in the 'ACT8847 are arranged in three levels along the data paths 
through the multiplier and the ALU. Each level of registers can be enabled or disabled 
independently of the other two levels by setting the appropriate PIPES2-PIPES0 inputs. 

The RA and RB registers receive data inputs from the temporary register and the DA 
and DB buses. Data operands are then multiplexed into the multiplier, ALU, or both. 
To support simultaneous pipelined operations, the data paths through the multiplier 
and the ALU are both provided with pipeline registers and output registers. The control 
settings for the pipeline and output registers (PIPES2-PIPES1 ) are registered with the 
instruction inputs 110-10. 

A seventh register, the constant (CI register is available for storing a 64-bit constant 
or an intermediate result from the multiplier or the ALU. The C register has a separate 
clock input (CLKC), input source select (SRCC) and write enable ENRC (active low). 
The SRCC input is not registered with the instruction inputs. Depending on the operation 
selected and the settings of PIPES2-PIPES0, an offset of one or more cycles may be 
necessary to load the desired result into the C register. When the flowthrough control 
FLOWC is high, the output value bypasses the C register without affecting C register 
contents. Timing for FLOWC feedback is similar to P or S register feedback, which 
is not affected by FLOWC feedback. 

Status results are also registered whenever the output registers are enabled. Duration 
and availability of status results are affected by the same timing constraints that apply 
to data results on the Y output bus. 




7-49 



Data Register Controls (PIPES2-PIPES0) 

Table 20 shows the settings of the registers controlled by PIPES2-PIPES0. Operating 
modes range from fully pipelined (PIPES2-PIPES0 = 000) to flowthrough 
(PIPES2-PIPES0 = 111). 

In flowthrough mode all three levels of registers are disabled, a circumstance which 
may affect some double-precision operations. Since double-precision operands require 
two steps to input, at least half of the data must be clocked into the temporary register 
before the remaining data is placed oh the DA and DB buses. 

When all registers (except the C register) are enabled, timing constraints can become 
critical for many double-precision operations. In clock mode 1, the ALU can perform 
a double-precision operation and output a result during every clock cycle, and both 
halves of the result must be read out before the end of the next cycle. Status outputs 
are valid only for the period during which the Y output data is valid. 

Similarly, double-precision multiplication is affected by pipelining, clock mode, and 
sequence of operations. A double-precise multiply may require two cycles to execute 
and two cycles to output the result, depending on the settings of PIPES2-PIPES0. 

Duration of valid outputs at the Y multiplexer depends on settings of PIPES2-PIPES0 
and CLKMODE, as well as whether all operations and operands are of the same type. 
For example, when a double-precision multiply is followed by a single-precision 
operation, one open clock cycle must intervene between the dissimilar operations. 



Z 




C Register Controls (SRCC, CLKC, FLOWC, ENRC) 

The C register loads from the P or the S register output, depending on the setting of 
SRCC, the load source select. SRCC = H selects the multiplier as input source. 
Otherwise the ALU is selected when SRCC = L. In either case t he C re gister only loads 
the selected input on a rising edge of the CLKC signal when ENRC is low. 

The C register does not load directly from an external data bus. One method for loading 
a constant without wasting a cycle is to input the value as an A operand during an 
operation which uses only the ALU or multiplier and requires no external data inputs. 
Since the B operand can be forced to zero in the ALU or to one in the multiplier, the 
A operand can be passed to the C register either by adding zero or multiplying by one, 
then selecting the input source with SRCC and causing the CLKC signal to go high. 



•vi Otherwise, the C register can be loaded through the ALU with the Pass A Operand 

£ instruction, which requires a separate cycle. 

^ Separate controls are available to enable the C register (ENRC) or to bypass the C 

00 register when feeding an operand back on the C register output bus (FLOWC). 
00 
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Operand Selection (SELOP7-SELOP0) 

As shown in Tables 6 and 7, data operands can be selected five possible sources, 
including external inputs from the RA and RB registers, feedback from the P and S 
registers, and a stored value in the C register. Contents of the C register may be selected 
as either the A or the B operand in the ALU, the multiplier, or both. When an external 
input is selected, the RA input always becomes the A operand, and the RB input is 
the B operand. 

Feedback from the ALU can be selected as the A operand to the multiplier or as the 
B operand to the ALU. Similarly, multiplier feedback may be used as the A operand 
to the ALU or the B operand to the multiplier. During division or square root operations, 
operands may not be selected except from the RA and RB input registers 
(SELOPS7-SELOPS0 = 11111111). 

Selection of operands also interacts with the selected operations in the ALU or the 
multiplier. ALU operations with one operand are performed only on the A operand. 
Also, depending on the instruction selected, the B operand may optionally be forced 
to zero in the ALU or to one in the multiplier. 

Rounding Controls (RND1-RND0) 

Because floating point operations may involve both inherent and procedural errors, 
it is important to select appropriate modes for handling rounding errors. To support 
the IEEE standard for binary floating-point arithmetic, the 'ACT8847 provides four 
rounding modes selected by RND1-RND0. 

Table 21 shows the four selectable rounding modes. The usual default rounding mode 
is round to nearest {RND1-RND0 = LL). In round-to-nearest mode, the 'ACT8847 
supports the IEEE standard by rounding to even (LSB = 0} when two nearest 
representable values are equally near. Directed rounding toward zero, infinity, or minus 
infinity are also available. 

Rounding mode should be selected to minimize procedural errors which may otherwise 
accumulate and affect the accuracy of results. Rounding to nearest introduces a 
procedural error not exceeding half of the least significant bit for each rounding 
operation. Since rounding to nearest may involve rounding either upward or downward 
in successive steps, rounding errors tend to cancel each other. 

In contrast, directed rounding modes may introduce errors approaching one bit for 
each rounding operation. Since successive rounding operations in a procedure may 
all be similarly directed, each introducing up to a one-bit error, rounding errors may 
accumulate rapidly, especially in single-precision operations. 

Status Exceptions 

Status flags are provided to signal both floating point and integer exceptions. Integer 
status is provided using AEQB for zero (Z), NEG for sign, and OVER for 
overflow/carryout. 
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Status exceptions can result from one or more error conditions such as overflow, 
underflow, operands in illegal formats, invalid operations, or rounding. Exceptions may 
be grouped into two classes: input exceptions resulting from invalid operations or 
denormal inputs to the multiplier, and output exceptions resulting from illegal formats, 
rounding errors, or both. 

To simplify the discussion of exception handling, it is useful to summarize the data 
formats for representing IEEE floating-point numbers which can be input to or output 
from the FPU (see Table 24). Since procedures for handling exceptions vary according 
to the requirements of specific applications, this discussion focuses on the conditions 
which cause particular status exceptions to be signalled by the FPU. 

IEEE formats for floating-point operands, both single and double precision, consist of 
three fields: sign, exponent, and fraction, in that order. The leftmost (most significant) 
bit is the sign bit. The exponent field is eight bits long in single-precision operands 
and 1 1 bits long in double-precision operands. The fraction field is 23 bits in single 
precision and 52 bits in double precision. Further details of IEEE formats and exceptions 
are provided in the IEEE Standard for Binary Floating-Point Arithmetic, ANSI/IEEE 
Std 754-1985. 

Several status exceptions are generated by illegal data or instruction inputs to the FPU. 
Input exceptions may cause the following signals to be set high: IVAL, DIVBYO, DENIN, 
and STEX1 -STEXO. If the IVAL flag is set, either an invalid operation, such as the square 
root of - |x|, has been requested or a NaN (Not a Number) has been input. When 
DENIN is set, a denormalized number has been input to the multiplier. DIVBYO is set 
when the divisor is zero. STEX1 -STEXO indicate which port (RA, RB, or both) is the 
source of the exception when either a denormal is input to the multiplier (DENIN = H) 
or a NaN (IVAL = H) is input to the multiplier or the ALU. 

NaN inputs are all treated as IEEE signaling NaNs, causing the IVAL flag to be set. 
When output from the FPU, the fraction field from a NaN is set high (all 1 's), regardless 
of the original fraction field of the input NaN. 

Output exception signals are provided to indicate both the source and type of the 
H exception. DENORM, INEX, INF, NEG, OVER, UNDER, and RNDCO indicate the 
Efl exception type, and CHEX and SRCEX indicate the source of an exception. SRCEX 

indicates the source of a result as selected by instruction bit 16, and SRCEX is active 

whenever a result is output, not only when an exception is being signaled. The chained- 
^ mode exception signal CHEX indicates that an exception has be generated by the source 
4* not selected for output by 16. The exception type signaled by CHEX cannot be read 

unless status select controls SELST1-SELST0 are used to force status output from 
_4 the deselected source. 
00 

00 Output exceptions may be due either to a result in an illegal format or to a procedural 
^ error. Results too large or too small to be represented in the selected precision are 
signalled by OVER and UNDER. When INF is high, the output is the IEEE representation 
of infinity. Any ALU output which has been increased in magnitude by rounding causes 
INEX to be set high. DENORM is set when the multiplier output is wrapped or the ALU 
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Table 24. IEEE Floating Point Representations 



TYPE OF 


EXPONENT (e) 


FRACTION (f) 


HIDDEN 




VALUE OF NUMBER REPRESENTED 


nocDAivin 
vJrfcnMrou 


SP (HEX) 


DP (HEX) 


itSlnl An Y i 


BIT 
til 1 




SP (DECIMAL) * 


DP (DECIMAL) * 


Normalized 
Number (max) 


FE 


7FE 


All 1's 


1 


(-1)5 


(2^27) (2-2-23) 


(-1)5 (21023) (2-2-52) 


Normalized 
Number (min) 


01 


001 


All 0's 


1 


(-1) s 


(2-126) (i) 


(_1)S (2-1022) («j) 


Denormalized 
Number (max) 


00 


000 


All 1's 


0 


(1-) s 


(2-126) (1 -2-23) 


(-1)* (2-1022) (1 -2-52) 


Denormalized 
Number (min) 


00 


000 


000.. .001 


0 


(-1) s 


(2-126) (2-23) 


(-1)* (2-1022) (2- 52) 


Wrapped 
Number (max) 


00 


000 


AIM'S 


1 


<-D s 


(2-127) (2-2-23) 


(-1)8 (2-1023) (2-2-52) 


Wrapped 
Number (min) 


EA 


7CD 


AH 0's 


1 


{-D s 


(2-122 + 127)) (1) 


{ -1)S {2 -(51 4-1023)) (D 


Zero 


00 


000 


Zero 


0 


(-D s 


(0.0) 


(-1) s (0.0) 


Infinity 


FF 


7FF 


Zero 


1 


(-1) s 


(infinity) 


(- 1 ) s (infinity) 


NAN (Not a 
Number) 


FF 


7FF 


Nonzero 


N/A 


None 




None 



= sign bit. 
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output is denormalized. Wrapped outputs from the multiplier may be inexact or 
increased in magnitude by rounding, which may cause the INEX and RNDCO status 
signals to be set high. A denormal output from the ALU (DENORM = H) may also cause 
INEX to be set, in which case UNDER is also signalled. 

Handling of Denormalized Numbers (FAST) 

The FAST input selects the mode for handling denormalized inputs and outputs. When 
the FAST input is set low, the ALU accepts denormalized inputs but the multiplier 
generates an exception when a denormal is input. When FAST is set high, the DENIN 
status exception is disabled and all denormalized numbers, both inputs and results, 
are forced to zero. 

A denormalized input has the form of a floating-point number with a zero exponent, 
a nonzero mantissa, and a zero in the leftmost bit of the mantissa (hidden or implicit 
bit). A denormalized number results from decrementing the biased exponent field to 
zero before normalization is complete. Since a denormalized number cannot be input 
to the multiplier, it must first be converted to a wrapped number by the ALU. When 
the mantissa of the denormal is normalized by shifting it left, the exponent field 
decrements from all zeros (wraps past zero) to a negative two's complement number 
(except in the case of .1XXX..., where the exponent is not decremented. 

Exponent underflow is possible during multiplication of small operands even when the 
operands are not wrapped numbers. Setting FAST = L selects gradual underflow so 
that denormal inputs can be wrapped and wrapped results are not automatically 
discarded. When FAST is set high, denormal inputs and wrapped results are forced 
to zero immediately. 

When the multiplier is in IEEE mode and produces a wrapped number as its result, 
the result may be passed to the ALU and unwrapped. If the wrapped number can be 
unwrapped to an exact denormal, it can be output without causing the underflow status 
flag (UNDER) to be set. UNDER goes high when a result is an inexact denormal, and 
a zero is output from the FPU if the wrapped result is too small to represent as a 
denormal (smaller than the minimum denorm). Table 25 describes the handling of 
wrapped multiplier results and the status flags that are set when wrapped numbers 
are output from the multiplier. 



Table 25. Handling Wrapped Multiplier Outputs 



TYPE 
OF RESULT 


STATUS FLAGS SET 
DENORM INEX RNDCO UNDER 


NOTES 


Wrapped, 

exact 

Wrapped, 

inexact 

Wrapped, 

increased in 

magnitude 


1 0 0 0 
1 1 0 1 

1 1 1 1 


Unwrap with 'Wrapped 
exact' ALU instruction 
Unwrap with 'Wrapped 
inexact' ALU instruction 

Unwrap with 'Wrapped 
rounded' ALU instruction 
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When operating in chained mode, the multiplier may output a wrapped result to the 
ALU during the same clock cycle that the multiplier status is output. In such a case 
the ALU cannot unwrap the operand prior to using it, for example, when accumulating 
the results of previous multiplications. To avoid this situation, the FPU can be operated 
in FAST mode to simplify exception handling during chained operations. Otherwise, 
wrapped outputs from the multiplier may adversely affect the accuracy of the chained 
operation, because a wrapped number may appear to be a large normalized number 
instead of a very small denormalized number. 

Because of the latency associated with interpreting the FPU status outputs and 
determining how to process the wrapped output, it is necessary that a wrapped operand 
be stored external to the FPU (for example, in an external register file) and reloaded 
to the A port of the ALU for unwrapping and further processing. 

Exception Disable Mask Register 

The exception disable mask register can be loaded with a mask to enable or disable 
selected status exceptions. Status bits for enabled exceptions are logically ORed, and 
when the result is true, the ED pin goes high. During chained operations both multiplier 
and ALU results are ORed. During independent operation the nonselected status results 
are forced to zero. 

If the FPU is reset (RESET = 0), the exception disable mask register is cleared. Table 26 
describes the settings for the mask register load instruction and the status exceptions 
which can be enabled or disabled with the mask. 



Table 26. Loading the Exception Disable Mask Register 



INSTRUCTION 
INPUTS 


RESULTS 


110-17 =0111 


Exception mask load instruction 


16 


0 = Load ALU exception disable register 

1 = Load multiplier exception disable register 


I5t 


0 = IVAL exception enabled 

1 = IVAL exception disabled 


14 


0 = OVER exception enabled 

1 = OVER exception disabled 


13 


0 = UNDER exception enabled 

1 = UNDER exception disabled 


12 


0 = INEX exception enabled 

1 = INEX exception disabled 


11 


0 = DIVBYO exception enabled 

1 = DIVBYO exception disabled* 


10 


0 = DENORM exception enabled 

1 = DENORM exception disabled 



m 



Disabling IVAL in multiplier exception mask register also disables DENIN exception 
Only significant when 16 = 1 
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Data Output Controls (SELMS/LS, OEY) 



Selection and duration of results from the Y output multiplexer may be affected by 
several factors, including the operation selected, precision of the operands, registers 
enabled, and the next operation to be performed. The data output controls are not 
registered with the data and instruction inputs. When the device is microprogrammed, 
the effects of pipelining and sequencing of operations should be taken into account. 

Two particular conditions need to be considered. Depending on which registers are 
enabled, an offset of one or more cycles must be allowed before a valid result is available 
at the Y output multiplexer. Also, certain sequences of operations may require both 
halves of a double-precision result to be read out within a single clock cycle. This is 
done by toggling the SELMS/LS signal in the middle of the clock period. 

When a single-precision result is output, the SELMS/LS signal has no effect. The 
SELMS/LS signal is set low only to read out the LSH of a double-precision result. 
Whenever this signal is selecting a valid result for output on the Y bus, the OEY enable 
must be pulled low at the beginning of that clock cycle. 

Status Output Controls (SELST1 -SELST0,OES, OEC) 

Ordinarily, SELST1 -SELSTO are set high so that status selection defaults to the output 
source selected by instruction input 16. The ALU is selected as the output source when 
16 is low, and the multiplier when 16 is high. 

When the device operates in chained mode, it may be necessary to read the status 
results not associated with the output source. As shown in Table 1 9, SELST1 -SELSTO 
can be used to read the status of either the ALU or the multiplier regardless of the 
16 setting. 

Status results are registered only when the output (P and S) registers are enabled 
(PIPES2 = L). Otherwise, the status register is transparent. In either case, to read 
the status outputs/the output enables (OES, OEC, or both) must be pulled low. 

B Stalling the Device (HALT) 
Operation of the 'ACTS847 can be stalled nondestructively by means of the HALT 
signal/Pulling the HALT input low causes the device to stall on the next low level 
(/> of the clock. Register contents are unaltered when the device is stalled, and normal 
2 operation resumes at the next low clock period after the HALT signal is set high. Using 
^ HALT in microprograms can save power, especially using high clock frequencies and 
> pipelined stages. 
O 

— I ■ 

CO 

00 
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For some operations, such as a double-precision multiply with CLKMODE = 1 , setting 
the HALT input low may interrupt loading of the RA, RB, and instruction registers, 
as well as stalling operation. In clock mode 1 , the temporary register loads on the falling 
edge of the clock, but the HALT signal going low would prevent the RA, RB, and 
instruction registers from loading on the next rising clock edge. It is therefore necessary 
to have the instruction and data inputs on the pins when the HALT signal is set high 
again and normal operation resumes. 

Instruction Inputs (110-10) 

Three modes of operation can be selected with inputs 110-10, including independent 
ALU operation, independent multiplier operation, or simultaneous (chained) operation 
of ALU and multiplier. Each operating mode is treated separately in the following 
sections. 

In addition to the ALU and multiplier instructions described below, a NOP (no operation) 
instruction is provided, for example, to retain a double-precision result on the Y output 
bus for an additional cycle: 

NOP 110-10 = 01 1 0000 0000 

Independent ALU Operations 

The ALU executes single- and double-precision operations which can be divided 
according to the number of operands involved, one or two. Tables 27 and 29 show 
independent ALU operations with one operand/along with the inputs 110-10 which 
select each operation. Conversions from one format to another are handled in this mode, 
with the exception of adjustments to precision during two-operand ALU operations. 
Wrapping and unwrapping of operands is also done in this mode. 

Logical shifts can be performed on integer operands using the instructions shown in 
Table 68. The data operand to be shifted is input on the DA bus, and the number of 
bit positions the operand is to be shifted is input on the DB bus. The shift number 
on the DB bus should be in positive 32-bit integer format, although only the lowest 
eight bits are used. Neither the data operand nor the shift amount can be selected 
from sources other than the RA and RB registers, respectively. 

Tables 28 and 29 present independent ALU operations with two operands. When the 
operands are different in precision, one single and the other double, the settings of 
the precision-selects 18-17 will identify the single-precision operand so that it can GO 
automatically be reformatted to double precision before the selected operation is ^ 
executed, and the result of the operation will be double precision. q 

< 

(/) 
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Table 27. Independent ALU Operations with One Floating Point Operand 



Ai ii nppRATinw 

HLU UrCnM 1 IISIM 




NOTES 


UN A UrtKAIMU 


IIVIDI ITC MA IA 


Pass A operand 


OOx 00 1x 0000 




Pass -A operand 


OOx 001 x 0001 




Convert from 2's 


OOx 0010 0010 




complement integer 






to floating point ^ 






Convert from floating 


00x001x0011 




point to 2's complement 






integer 




A — L/Ull I udlc 


Move A operand (pass 


UUX UU IX Ul uu 


18 selects precision of A 


without NaN detect or 




nnpranH 


status flags active) 




0 = A (SP) 


Pass B operand 


OOx 001 x 0101 


1 = A (DP) 


Convert from floating 


OOx 001x 0110 


14 selects absolute value of 


point to floating point 


00x001x0111 


a operand: 


(adjusts precision of 




0 = A 


input: SP.-* DP, DP -* SP) 




1 = |A| 


Floating point to 




During integer to floating 


un^innpri intpnpr 




point conversion, |A | is not 


conversion 




allowed as a result. 


Wrap denormal operand 


00x001x 1000 




Unsigned integer to 


00x001x 1010 




floating point 






conversion 






Unwrap exact number 


OOx 001 x 1100 




Unwrap inexact number 


00x001x 1101 




Unwrap rounded input 


00x001x 1110 






During this operation, 18 selects the precision of the result. 



C/> 
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Table 28. Independent ALU Operations with Two Floating Point Operands 



ALU OPERATIONS 


INSTRUCTION 


AND OPERANDS 


INPUTS 110-10 


Add A + B 


OOx xOOO 0x00 


Add |A| + B 


OOx x001 0x00 


Add A + |B| 


OOx xOOO 1 xOO 


Add |A| + |B| 


OOx x001 1x00 


Subtract A - B 


OOx xOOO 0x01 


Subtract |A| - B 


OOx x001 0x01 


Subtract A - |B| 


OOx xOOO 1x01 


Subtract |A| - |B| 


OOx x001 1x01 


Compare A, B 


OOx xOOO 0x10 


Compare |A| , B 


OOx xO01 0x10 


Compare A, |B| 


OOx xOOO 1x10 


Compare |A|, |B| 


OOx x001 1x10 


Subtract B - A 


OOx xOOO 0x11 


Subtract B — | A | 


OOx x001 0x11 


Subtract | B | — A 


OOx xOOO 1x11 


Subtract | B | — 


OOx x001 1x11 



NOTES 



x = Don't Care 

18 selects precision of A 

operand: 

0 = A (SP) 

1 = A (DP) 

17 selects precision of B 
operand: 

0 = B (SP) 

1 = B (DP) 

12 selects either Y or its 
absolute value: 

0 = Y 

1 = |Y| 



Table 29. Independent ALU Operations with One Integer Operand 



ALU OPERATION 
ON A OPERAND 


INSTRUCTION 
INPUTS 110-10 


NOTES 


Pass A operand 

Pass -A operand (2's complement) 
Negate A operand (1's complement) 
Pass B operand 
Shift left logical 
Shift right logical 
Shift right arithmetic 


010 x010 0000 
010x0100001 
010 x010 0010 
010 x010 0101 
010 x010 1000 
010 x010 1001 
010 x010 1101 


x = Don't Care 

18 selects format of A integer 

operand: 

0 = Single-precision 2's 
complement 

1 = Single-precision unsigned 
integer 



m 
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Table 30. Independent ALU Operations with Two Integer Operands 



ALU OPERATIONS 


INSTRUCTION 


NOTES 


AND OPERANDS 


INPUTS 110-10 


Add A + B 


010 xOOO 0000 




Subtract A - B 


010 xOOO 0001 


x - Don't Care 


Compare A, B 


010 xOOO 0010 


1 7 selects format of A and B 


Subtract B - A 


010 xOOO 0011 


operands: 


Logical AND A, B 


010 xOOO 1000 


0 = Single-precision 2's 


Logical AND A, NOT B 


010 xOOO 1001 


complement 


Logical AND NOT A, B 


010 xOOO 1010 


1 = Single-precision unsigned 


Logical OR A, B 


010 xOOO 1100 


integer 


Logical XOR A, B 


010x000 1101 





Independent Multiplier Operations 

In this mode the multiplier operates on the RA and RB inputs which can be either single 
precision, double precision, or mixed. Separate instruction tables are provided for 
floating point operations and integer operations. 

Floating point operands may be normalized or wrapped numbers, as indicated by the 
settings for instruction inputs 11-10. As shown in Table 31, the multiplier can be set 
to operate on the absolute value of either or both floating point operands, and the 
result of any operation can be negated when it is output from the multiplier. Converting 
a single-precision denormal number to double precision does not normalize or wrap 
the denormal, so it is still an invalid input to the multiplier. 



Table 31 . Independent Floating Point Multiply Operations 



2 

> 

O 
H 
00 
00 




MULTIPLIER OPERATION 
AND OPERANDS 


INSTRUCTION 
INPUTS 110-10 


NOTES 


Multiply A * B 
Multiply -(A * B) 
Multiply A * |B| 
Multiply -(A * |B|) 
Multiply |A| * B 
Multiply -(|A| * B) 
Multiply |A| * |B| 
Multiply ~(|A| * |B|) 


OOx x100 OOxx 
OOx x 100 01 xx 
OOx x 1 00 1 0xx 
OOx x100 1 1xx 
OOx x101 OOxx 
OOx x101 01 xx 
OOx x101 10xx 
OOx x101 11 xx 


x = Don't Care 
18 selects A operand 
precision (0 = SP, 1 = DP) 
17 selects B operand 
precision (0 = SP, 1 = DP) 
11 selects A operand format 
(0 - Normal, 1 = Wrapped) 
10 selects B operand format 
(0 = Normal, 1 == Wrapped) 
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Table 32. Independent Floating-Point Divide/Square Root Operations 



MULTIPLIER OPERATION 
AND OPERANDS* 


INSTRUCTION 
INPUTS 110-10 


NOTES 


Divide A / B 
SQRT A 
Divide |A|/ B 
SQRT |A| 


OOx xi 10 Oxxx 
OOx x110 1xxx 
OOx x1 10 Oxxx 
00xx111 1xxx 


x - Don't Care 
18 selects A operand precision 
and 17 selects B operand 
precision (0 = SP, 1 = DP) 
12 negates multiplier result 
(0 = Normal, 1 = Negated) 
11 selects A operand format and 
10 selects B operand format 
(0 = Normal, 1 = Wrapped) 


^17 should be low or equal to 18 for square root operations 

Table 33. Independent Integer Multiply /Divide/Square Root Operations 


MULTIPLIER OPERATION 
AND OPERANDS* 


INSTRUCTION 
INPUTS 110-10 


NOTES 


Multiply A * B 
Divide A / B 
SQRT A 


010x100 0000 
010x110 0000 
010 x110 1000 


x = Don't care 

17 selects operand format: 

0 = SP 2's complement 

1 = SP unsigned integer 



Operations involving absolute values, wrapped operands, or negated results are valid only when floating- 
point format is selected (19 = 0). 



Chained Multiplier/ALU Operations 

In chained mode the 'ACT8847 performs simultaneous operations in the multiplier 
and the ALU. Operations include not only addition, subtraction, and multiplication, 
but also several optional operations which increase the flexibility of the device. Division 
and square root operations are not available in chained mode. 

The B operand to the ALU can be set to zero so that the ALU passes the A operand 
unaltered. The B operand to the multiplier can be forced to the value 1 so that the 
A operand to the multiplier is passed unaltered. 
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Table 34. Chained Multiplier/ALU Floating Point Operations 1 * 



CHAINED OPERATIONS 


OUTPUT 


INSTRUCTION 

IMDI ITC lin |A 

IIMrU I o liU-IU 


NOTES 


MULTIPLIER 


ALU 


A * B 


A + B 


A 1 1 1 
ALU 


lUX xuuu xxuu 




A * B 


A + B 


Multiplier 


10x x100 xxOO 




A * B 


A - B 


ALU 


10x xOOO xx01 




A * B 


A - B 


Multiplier 


10x x100 xx01 




A * B 


2 - A 


ALU 


10x xOOO xx10 


x = Don't Care 


A * B 


2 - A 


Multiplier 


10xx100xx10 


18 selects precision of 


A * B 


B - A 


ALU 


10x xOOO xx 1 1 


RA inputs: 


A * B 


B - A 


Multiplier 


10x x100 xx 1 1 


O — RA fQP\ 
\J — r\M (Or/ 


A * B 


A 4- 0 


ALU 


10x x010 xxOO 


1 — RA IDP) 


A * B 


A + 0 


Multiplier 


10x x1 10 xxOO 


17 selects precision of 


A * B 


0 - A 


ALU 


10xx010xx11 


RB inputs: 


A * B 


0 - A 


Multiplier 


10x x110 xx11 


0 = RB (SP) 

\J 1 1 ImJ \ \J 1 / 


A * 1 


A + B 


Al 1 1 


i Av vHn 1 wfin 
i \ja xuu i xxuu 


1 = RB (DP) 


A "* 1 


A + B 


iviuiiipiier 


I ux X I U I xxuu 


13 negates ALU result: 


A * 1 


A - B 


Al 1 I 
MLU 


I ux xuu I XXU I 


0 = Normal 


A * I 


A - B 


Multiplier 


I UX X I U I XXU I 


1 = Negated 


A * 1 


2 - A 


ALU 


1 v/A AvU 1 A A 1 \J 


12 negates multiplier 










result: 


A * 1 


2 - A 


Multiplier 


10x x101 xx10 


0 = Normal 


A * 1 


B - A 


ALU 


10x x001 xx1 1 










1 = Negated 


A * 1 


B - A 


Multiplier 


10x x101 xx11 


A * 1 


A + 0 


ALU 


lOx x011 xxOO 




A * 1 


A + 0 


Multiplier 


I0xx111 xxOO 




A * 1 


0 - A 


ALU 


10x x011 xx11 




A * 1 


0 - A 


Multiplier 


10x xl 11 xx11 





'The 110-10 setting 1xx xxlx xx10 is invalid, since it attempts to force the B operand of the ALU to both 
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Table 35. Chained Multiplier/ALU Integer Operations 



CHAINED OPERATIONS 


OUTPUT 
SOURCE 


INSTRUCTION 
INPUTS 110-10 


NOTES 


MULTIPLIER 


ALU 


A * B 


A ■ + B 


ALU 


110x000 0000 




A * B 


A + B 


Multiplier 


1 10 xTOO 0000 




A * B 


A - B 


ALU 


110 xOOO 0001 




A * B 


A - B 


Multiplier 


110x100 0001 




A . * B 


2 - A 


ALU 


110 xOOO 0010 




A * B 


2 - A 


Multiplier 


110 x100 0010 




A * B 


B - A 


ALU 


110x000 0011 




A * B 


B - A 


Multiplier 


110x100 0011 


y = Dnn't f^pirp 








A * B 


A + 0 


ALU 


110x010 0000 


17 selects format of A 










A * B 


A + 0 


Multiplier 


110x110 0000 


and B operands; 


A * B 


0 - A 


ALU 


110 x010 0011 


0 = SP 2's 


A * B 


0 - A 


Multiplier 


110 x1 10 001 1 


complement 


A * 1 


A + B 


ALU 


110 x001 0000 


1 = SP unsigned 


A * 1 


A + B 


Multiplier 


110x101 0000 


integer 


A * 1 


A - B 


ALU 


110 x001 0001 




A * 1 


A - B 


Multiplier 


110 x101 0001 




A * 1 


2 - A 


ALU 


110x001 0010 




A * T 


2 - A 


Multiplier 


110 x101 0010 




A * 1 


B - A 


ALU 


110 x001 0011 




A * 1 


B - A 


Multiplier 


110 x101 0011 




A * 1 


A + 0 


ALU 


110 x011 0000 




A * 1 


A + 0 


Multiplier 


110x111 0000 




A * 1 


0 - A 


ALU 


110 x011 0011 




A * 1 


0 - A 


Multiplier 


110 x1 1 1 xx11 






oo 
oo 
I- 
o 
< 

CO 



7 63 



MICROPROGRAMMING THE ACT8847 



Because the 'ACT8847 is microprogrammable, it can be configured to operate on either 
single- or double-precision data operands, and the operations of the registers, ALU, 
and multiplier can be programmed to support a variety of applications. The following 
examples present not only control settings but the timings of the specific operations 
required to execute the sample instructions. 

Timing of the sample operations varies with the precision of the data operands and 
the settings of CLKMODE and PIPES. Microinstructions and timing waveforms are given 
for all combinations of data precision, clock mode, and register settings. 

Division and square root operations are presented after the discussion of ALU and 
multiplier operations. Following the presentation of ALU and multiplier operations is 
a brief sum-of-products operation using instructions for chained operating mode. 

Single-Precision Operations 

Two single-precision operands can be loaded on the 32-bit input buses without use 
of the temporary register so CLKMODE has no effect on single-precision operation. 
Both the ALU and the multiplier execute all single-precision instructions in one clock 
cycle, assuming that the device is not operating in flowthrough mode (all registers 
disabled). Settings of the register controls PIPES2-PIPES0 determine minimum cycle 
time and the rate of data throughput, as evident from the examples below. 

Single-Precision ALU Operations 

Precision of each data operand is indicated by the setting of instruction input 18 for 
single-operand ALU instructions, or the settings of 18-17 for two-operand instructions. 
When the ALU receives mixed-precision operands (one operand in single precision and 
the other in double precision), the single-precision data input is converted to double 
and the operation is executed in double precision. 

If both operands are single precision, a single-precision result is output by the ALU. 
Operations on mixed-precision data inputs produce double-precision results. 

It is unnecessary to use the 'convert float-to-float' instruction to convert the single- 
precision operand prior to performing the desired operation on the mixed-precision 
operands. Setting 18 and 17 properly achieves the same effect without wasting an 
instruction cycle. 

Single-Precision Multiplier Operations 

Operand precision is selected by 18 and 17, as for ALU operations. The multiplier can 
multiply the A and B operands, either operand with the absolute value of the other, 
or the absolute values of both operands. The result can also be negated when it is 
output. If both operands are single precision, a single-precision result is output. 
Operations on mixed-precision data inputs produce double-precision results. 
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Sample Single-Precision Microinstructions 



The following four single-precision microinstruction coding examples show the four 
register settings, ranging from flowthrough to fully pipelined. Timing diagrams 
accompany the sample microinstructions. 

In the first example PIPES2-PIPES0 are all set high so the internal registers are all 
disabled. This microinstruction sets up a wrapped result from the multiplier to be 
unwrapped by the ALU as an exact denormalized number. In flowthrough mode the 
'unwrap exact' operation is performed without a clock as soon as the instruction is 
input. Single-precision timing in flowthrough mode is shown in Figure 2. 

CLKMODE = 0 PIPES = 111 Operation: Unwrap A operand exact 
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X 



INSTRUCTION: FUNCdO.O), RNDH.O). FAST 



IX 



FIRST OPERANDS 



X 



DATA{31,0) A AND B INPUTS 



SECOND INSTRUCTION 



X 



SECOND OPERANDS 



X 



FIRST 
RESULT 



W tpcH W 

OUTPUT(31,0), STATUS(18,0) 



SECOND 
RESULT 



Figure 2. Single-Precision Operation, All Registers Disabled 
(PIPES =111, CLKMODE - 0) 
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The second example shows a microinstruction causing the ALU to compare absolute 
values of A and B. Only the input registers are enabled (PIPES2 PIPESO = 1 10) so 
the result is output in one clock cycle. 

CLKMODE = 0 PIPES = 110 Operation: Compare |A|,|B| 
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Vd2- 



Figure 3. Single-Precision Operation, Input Registers Enabled 
(PIPES =110, CLKMODE = 0) 
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Input and output registers are enabled in the third example, which shows the subtraction 
B - A. Two clock cycles are required to load the operands, execute the subtraction, 
and output the result (see Figure 4). 
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Figure 4. Single-Precision Operation, Input and Output Registers Enabled 
(PIPES - 010, CLKMODE * 0) 
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The fourth example shows a multiplication A * B with all registers enabled. Three 
clock cycles are required to generate and output the product. Once the internal registers 
are all loaded with data or results, a result is available from the output register on every 
rising edge of the clock. The floating point unit produces its highest throughput when 
operated fully pipelined with single-precision operands. 

CLKMODE = 0 PIPES = 000 Operation: Multiply A * B 
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Figure 5. Single-Precision Operation, Ail Registers Enabled 
(PIPES - 000, CLKMODE - 0) 
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Double-Precision Operations 

Double-precision operations may be executed separately in the ALU or the multiplier, 
or simultaneously in both. Rates of execution and data throughput are affected by 
the settings of the register controls (PIPES2-PIPES0) and the clock mode (CLKMODE). 

The temporary register can be loaded on either the rising edge (CLKMODE = L) or 
the falling edge of the clock (CLKMODE = H). Double-precision operands are always 
loaded by using the 64-bit temporary register to store half of the operands prior to 
inputting the other half of the operands on the DA and DB buses. 

Input configuration is selected by CONFIG 1-CONFIGO, allowing several options for 
the sequence in which data operands are set up in the temporary register and the RA 
and RB registers. Operands are then sent to either the ALU or multiplier, or both, 
depending on the settings for SELOP 7-0. 

The ALU executes all double-precision operations in a single clock cycle. The multiplier 
requires two clock cycles to execute a double-precision operation. When the device 
operates in chained mode (simultaneous ALU and multiplier operations), the chained 
double-precision operation is executed in two clock cycles. The settings of 
PIPES2-PIPES0 determine whether the result is output without a clock (flowthrough) 
or after up to five clocks for a double-precision multiplication (all registers enabled 
and CLKMODE = L). 

Double-Precision ALU Operations 

Eight examples are provided to illustrate microinstructions and timing for double- 
precision ALU operations. The settings of CLKMODE and PIPES2-PIPES0 determine 
how the temporary register loads and which registers are enabled. Four examples are 
provided in each clock mode. 

Double-Precision ALU Operations with CLKMODE = 0 

The first example shows that, even in flowthrough mode, a clock signal is needed 
to load the temporary register with half the data operands (see Figure 6). The selected 
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operation is executed without a clock after the remaining half of the data operands 
are input on the RA and RB buses: 



CLKMODE = 0 PIPES = 111 Operation: Add A + |B| 
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Figure 6. Double-Precision ALU Operation, All Registers Disabled 
(PIPES -111, CLKMODE - 0) 
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In the second example the input register is enabled (PIPES2-PIPESO = 110). Operands 
A and B for the instruction, |B| - |A|, are loaded using CONFIG = 00 so that B is 
loaded first into the temporary register with MSH through the DA port and LSH through 
the DB port. On the second clock rising edge, the A operand is loaded in the same 
order directly to RA register while B is loaded from the temporary register to the RB 
register (see Figure 7). 

CLKMODE = 0 PIPES = 110 Operation: |B| - |A| 
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Figure 7. Double-Precision ALU Operation, Input Registers Enabled 
(PIPES * 110, CLKMODE - 0) 
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Both the input and output registers are enabled (PIPES2-PIPES0 = 010) in the third 
example. The instruction sets up the ALU to wrap a denormalized number on the DA 
input bus. The wrapped output can be fed back from the S register to the multiplier 
input multiplexer by a later microinstruction. Timing for this operation is shown in 
Figure 8. 

CLKMODE = 0 PIPES = 010 Operaion: Wrap Denormal Input 
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Figure 8. Double-Precision ALU Operation, Input and Output Registers Enabled 
(PIPES = 010, CLKMODE - 0) 
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In the fourth example with CLKMODE = L, all three levels of internal registers are 
enabled. The instruction converts a double-precision integer operand to a double- 
precision floating-point operand. Figure 9 shows the timing for this operating mode. 

CLKMODE = 0 PIPES = 000 Operation: Convert Integer to Floating Point 
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Figure 9. Double-Precision ALU Operation, All Registers Enabled 
(PIPES = 000, CLKMODE - 0) 
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Double-Precision ALU Operations with CLKMODE = 1 



The next four examples are similar to the first four except that CLKMODE = H so that 
the temporary register loads on the falling edge of the clock. When the ALU is operating 
independently, setting CLKMODE high enables loading of both double-precision 
operands on successive falling and rising clock edges. 

In this clock mode a double-precision ALU operation requires one clock cycle to load 
data inputs and execute, and both halves of the 64-bit result must be read out on 
the 32-bit Y bus within one clock cycle. The settings of PIPES2-PIPES0 determine 
the number of clock cycles which elapse between data input and result output. 

In the first example all registers are disabled (PIPES2-PIPES0 = 111), and the addition 
is performed in f lowthrough mode. As shown in Figure 1 0, a falling clock edge is needed 
to load half of the operands into the temporary register prior to loading the RA and 
RB registers on the next rising clock. 

CLKMODE = 1 PIPES = 111 Operation: Add A + |.B| 
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Figure 10. Double-Precision ALU Operation, Ail Registers Disabled 
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The second example executes subtraction of absolute values for both operands. Only 
the RA and RB registers are enabled (PIPES2-PIPES0 = 110). Timing is shown in 
Figure 1 1 . 
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Figure 11. Double-Precision ALU Operation, Input Registers Enabled 
(PIPES - 110, CLKMODE « 1) 
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The third example shows a single denormalized operand being wrapped so that it can 
be input to the multiplier. Both input and output registers are enabled 
(PIPES2-PIPES0 = 010). Timing is shown in Figure 12. 

CLKMODE = 1 PIPES = 010 Operation: Wrap Denormal Input 
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Figure 12. Double-Precision ALU Operation, Input and Output Registers Enabled 
(PIPES - 010, CLKMODE - 1) 



The fourth example shows a conversion from integer to floating point format. All three 
levels of data registers are enabled (PIPES2-PIPES0) so that the FPU is fully pipelined 
in this mode (see Figure 13). 

CLKMODE = 1 PIPES = 000 Operation: Convert Integer to Floating Point 
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Figure 13. Double-Precision ALU Operation, All Registers Enabled 
(PIPES - 000, CLKMODE - 1) 



Double-Precision Multiplier Operations 

Independent multiplier operations may also be performed in either clock mode and with 
various registers enabled. As before, examples for the two clock modes are treated 
separately. A double-precision multiply operation requires two clock cycles to execute 
(except in flowthrough mode) and from one to three other clock cycles to load the 
temporary register and to output the results, depending on the setting of 
PIPES2-PIPES0. 

Even in flowthrough mode (PIPES2-PIPES0 = 111) two clock edges are required, the 
first to load half of the operands in the temporary register and the second to load the 
intermediate product in the multiplier pipeline register. Depending on the setting of 
CLKMODE, loading the temporary register may be done on either a rising or a falling 
edge. 

Double-Precision Multiplication with CLKMODE = 0 

In this first example, the A operand is multiplied by the absolute value of B operand. 
Timing for the operation is shown in Figure 14: 

CLKMODE = 0 PIPES = 111 Operation: Multiply A *|B| 



S 
E 



0-0 

001 1100 1000 



ccc 






L 






L 0 0 P P 


S S 




M 




S S 


K N N I I 


E E 




S 




BEER 


M F F P P 


L L 


R R 


FEES/ 




YLLEH 


ON EE 


0 0 


N N 


A N N R 


OOOTSS SATT 


D G G S S 


P P 


D D 


S R R C L 


E E E 


E T T E L P P 


E 1-0 2-0 


7-0 


1-0 


TABCSYCS 


P 1-0 T T 1-0 


0 11111 


1111 xxxx 


00 


0 xx x x 


0 0 0 


X XX 1 1 11 




<t 
oo 
oo 

K 
O 
< 



7-85 



£t78813Vt7Z.NS 




Load Half of 
Operands 



CLK 



Load Pipeline 




DATA(31,0) A AND B INPUTS 



SELMS/LS 



OUT(31,0) STATUS(18,0) 



- t pd2- 



\ * t pd5 _+| 



Figure 14. Double-Precision Multiplier Operation, All Registers Disabled 
(PIPES -111, CLKMODE - 0) 



The second example assumes that the RA and RB input registers are enabled. With 
CLKMODE = 0 one clock cycle is required to input both the double-precision operands. 
The multiplier is set up to calculate the negative product of |A | and B operands: 

CLKMODE - 0 PIPES = 110 Operation: Multiply -~{|A| * B) 
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Figure 15. Double-Precision Multiplier Operation, Input Registers Enabled 
(PIPES - 110, CLKMODE - 0) 
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Enabling both input and output registers in the third example adds an additional delay 
of one clock cycle, as can be seen from Figure 16. The sample instruction sets up 
calculation of the product of | A| and |B|: 

CLKMODE = 0 PIPES = 010 Operation: Multiply |A| * |B| 
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Figure 16. Double-Precision Multiplier Operation, Input and Output Registers Enabled 
(PIPES * 010, CLKMODE - 0) 
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With all registers enabled, the fourth example shows a microinstruction to calculate 
the negated product of operands A and B: 



CLKMODE = 0 PIPES = 000 Operation: Multiply -(A * B) 
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Figure 17. Double-Precision Multiplier Operation, All Registers Enabled 
(PIPES - 000, CLKMODE = 0) 



Double-Precision Multiplication with CLKMODE = 1 



Setting the CLKMODE control high causes the temporary register to load on the falling 
edge of the clock. This permits loading both double-precision operands within the same 
clock cycle. The time available to output the result is also affected by the settings 
of CLKMODE and PIPES2-PIPES0, as shown in the individual timing waveforms. 

The first multiplication example with CLKMODE set high shows a multiplication in 
flowthrough mode (PIPES2-PIPES0 = 111). Figure 18 shows the timing for this 
operating mode: 

CLKMODE = 1 PIPES = 111 Operation: Multiply A * |B| 
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Figure 18. Double-Precision Multiplier Operation, All Registers Disabled 
(PIPES - 111, CLKMODE * 1) 

In the second example, the input registers are enabled and the instruction is otherwise 
similar to the corresponding example for CLKMODE = 0. Timing is shown in Figure 1 9. 
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Figure 19. Double-Precision Multiplier Operation, Input Registers Enabled 
(PIPES > 110, CLKMODE - 1) 
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With both input and output registers pipelined, the third example calculates the product 
of | A | and | B | . Enabling the output register introduces a one-cycle delay in outputting 
the result (see Figure 20): 

CLKMODE = 1 PIPES = 010 Operation: Multiply |A| * |B| 
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Figure 20. Double-Precision Multiplier Operation, Input and Output Registers Enabled 
(PIPES - 010, CLKMODE - 1) 
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The fourth example shows the instruction and timing (Figure 21) to generate the 
negated product of the A and B operands. This operating mode with CLKMODE set 
high and all registers enabled permits use of the shortest clock period and produces 
the most data throughput, assuming that this is the primary operating mode in which 
the device is to function. 

Additional considerations affecting timing and throughput are discussed in the section 
on mixed operations and operands. 

CLKMODE = 1 PIPES - 000 Operation: Multiply -(A * B) 
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Figure 21. Double-Precision Multiplier Operation, All Registers Enabled 
(PIPES - 000, CLKMODE =1) 
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Division and Square Root Operations 

The examples presented below contain sample microinstructions and timing diagrams 
for all modes of division and square root. The permissible settings of internal registers 
(PIPES2-PIPES0) are presented separately. To perform a valid divide or square root 
calculation, at least the RA and RB input registers must be enabled (PIPESO = 0). 

Sample timing waveforms are provided both for division and square root, along with 
sample microinstructions for permissible register control settings. Results for these 
operations are valid for one cycle and may be written over by results of the next 
instruction. This may happen when a double-precision operation is followed by a single- 
precision operation, notably when only the input registers are enabled 
(PIPES2-PIPES0 = 110). 

To retain a double-precision result on the Y output bus for an extra cycle, a NOP (no 
operation) may be coded using the following instruction: 

NOP 110-10 = 01 1 0000 0000 

Input enables ENRA and E NRB may also be set low for NOP this instruction to prevent 
loading of new data. The OEY signal is set low and SELMS/LS is set to get out the 
second half of the double-precision result. 

Division Microinstructions 

Division calculations are executed as independent multiplier operations selected by 
instruction bits 110, 1 5 and 13. Independent multiplier operation is selected when 110 
is set low. A divide operation requires that 15 is high and 13 is low. 

In all division operations the A operand is the dividend and the B operand is the divisor. 
Operands may be single-precision integers, single-precision floating point numbers, 
or double-precision floating point numbers. The operands must be either both integer 
or both floating point. 

Mixed-precision floating point operations are executed as double-precision operations, 
with the single-precision operand converted to double-precision format automatically. 
However, a single-precision wrapped input cannot be converted to double precision, 
so mixed-precision division involving a single-precision wrapped operand is not 
(/> permitted. 
2 

'^J Single-Precision Floating Point Division 

^ The following four sample microinstructions select division operations on singie- 
£J precision floating point operands. Each example includes a timing diagram for a different 
00 setting of the internal register controls {PIPES2-PIPES0). Operands and data inputs 
^ are presented at the same time in these instructions. 



The first example shows an instruction to perform | Aj / B with only the RA and RB 
input registers enabled (PIPES2-PIPES0 = 110). The output is available after the 
seventh rising clock edge and may remain stable for that cycle only (see Figure 22). 
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Figure 22. Single-Precision Floating Point Division 
(PIPES2-PIPES0 =110) 
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In the second example, a wrapped number is divided by a normalized number. The 
result of this operation may be an inexact zero, setting the DENORM, INEX, and UNDER 
status flags. The multiplier pipeline is enabled, allowing the next instruction to load 
one cycle earlier, on the seventh rising clock. 

CLKMODE = 0 PIPES = 100 Operation: A(wr) / B 
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Figure 23. Single-Precision Floating Point Division 
(PIPES2 PIPESO - 100) 
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Input and output registers are enabled (PIPES2 - PIPESO = 01 0) in the next example, 
which shows division of two wrapped numbers. The result, which will probably be 
a normalized number, is negated by setting 12 high. Enabling the output register adds 
a one cycle delay to the output. 

CLKMODE = 0 PIPES = 010 Operation: -A(wr) / B(wr) 
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Figure 24. Single-Precision Floating Point Division 
(PIPES2-PIPES0 = 010) 
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The fourth microinstruction is A/B with all registers enabled. The output is available 
after the eighth rising clock edge, and the next instruction may be loaded on the seventh 
rising clock. 

GLKMODE = 0 PIPES = 000 Operation: A/B 
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Figure 25. Single-Precision Floating Point Division 
(PIPES2-PIPES0 = 000) 

Double-Precision Floating Point Division 

The next four examples show double-precision floating point division. The operands 
may be single or double precision, or a combination of both. However, wrapped single 
precision numbers may not be used. A single-precision number should be loaded in 
the most significant half of the double precision operand. 

Each example includes a timing diagram for a different setting of the internal register 
controls (PIPES2-PIPES0). The examples use both clock modes 0 and 1. 
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The first example shows a mixed-precision, mixed-mode operation with input registers 
RA and RB enabled. A single precision number is divided by a double precision wrapped 
number. The output, in double-precision format is available after thirteen rising clock 
edges, and may be valid for that cycle only. If the next operation is double precision, 
the first half of the operands may be loaded on the rising edge of clock 1 3 (using clock 
mode 0). The second half operands and the instruction are loaded on the fourteenth 
clock. A single-precision operation may only be loaded on the fourteenth rising edge. 

CLKMODE = 0 PIPES = 110 Operation: A / B(wr) 
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Figure 26. Double-Precision Floating Point Division 
(PIPES2-PIPES0 = 110) 
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In the next example, the divisor is a wrapped number. Since both numbers are double 
precision, the result may overflow/ setting the OVR flag. With the multiplier pipeline 
and input registers enabled, the next double-precision operands and instruction may 
be loaded on the twelfth and thirteenth rising clock edges (clock mode 0), as shown 
in Figure 27. 

CLKMODE = 0 PIPES = 100 Operation: A / B(wr) 
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Figure 27. Double-Precision Floating Point Division 
(PIPES2-PIPES0 = 100) 
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A double-precision number is divided by a single-precision number in the third example. 
Clock mode 1 is used, with input and output registers enabled (PIPES2-PIPES0 = 010). 
The output is available after the fourteenth rising clock edge and may only be valid 
for that cycle. If the next operation is double precision, half the operands can be loaded 
on the falling edge of the thirteenth clock, and the second half operands and instruction 
load on rising edge of the fourteenth clock. If the next operation is single precision, 
the instruction must be loaded on the fourteenth rising edge. 

CLKMODE = 1 PIPES = 010 Operation: A/B 
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Figure 28. Double-Precision Floating Point Division 
(PIPES2-PIPES0 - 010) 
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The last example is a microinstruction for A / B with all registers enabled 
(PIPES2-PIPES0 = 000). Both operands are double precision. The output is available 
after the fourteenth rising edge. The next double-precision operation may be loaded 
on the falling edge of clock 12 and the rising edge of clock 13 using clock mode 1 . 
If the next operation is single precision, the instruction must be loaded on the thirteenth 
rising clock edge. 

CLKMODE = 1 PIPES = 000 Operation: A / B 
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Figure 29. Double-Precision Floating Point Division 
(PIPES2-PIPES0 ■ 000) 
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Integer Division 

The following sample microinstructions perform division operations on single-precision 
integer operands. Both operands must be in the same format (unsigned or two's 
complement), and absolute value, wrapped numbers, and negated outputs are not 
allowed. Each example includes a timing diagram for a different setting of the internal 
register controls (PIPES2-PIPES0). 

The first example shows A / B with only the input registers RA and RB enabled. Both 
operands are in two's complement format. The result is available after the rising edge 
of the fifteenth clock and may be valid for only that cycle. The next instruction and 
data may be loaded on the sixteenth clock. 



CLKMODE = 0 PIPES = 110 Operation; A/B 
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Enabling the multiplier pipeline register in the second example allows the next instruction 



to be loaded one cycle earlier. The quotient is still not available until after the fifteenth 


rising clock (see Figure 31). 
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Figure 31 . Integer Division 
(PIPES2-PIPES0 - 100) 
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In the third example, unsigned integer numbers are divided. The pipeline settings 
(PIPES2-PIPES0 = 010) enable the input and output registers. Timing for this 
microinstruction is shown in Figure 32. 

CLKMODE = 0 PIPES = 010 Operation: A /B 
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Figure 32. Integer Division 
(PIPES2-PIPES0 * 010) 
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The final example has all internal registers enabled. The output is valid after the 
sixteenth clock edge, but the next instruction can be loaded one cycle earlier. 

CLKMODE = 0 PIPES = 000 Operation: A / B 
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Figure 33. Integer Division 
(PIPES2-PIPES0 = 000) 



Square Root Microinstructions 

Square root calculations are performed as independent multiplier operations, as 
determined by the settings of instruction bits 110,15, and 13. Independent multiplier 
operation is selected by setting 110 low, while both 1 5 and 13 must be set high to select 
a square root operation. 

Taking the square root of a nonzero negative number sets the IVAL flag and produces 
an NaN result. Square root of negative zero does not set IVAL and produces a negative 
zero result. Only the A operand is used in a square root calculation. The operand may 
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be a single-precision integer, single-precision floating point number, or double-precision 
floating point number. Since the B operand is unused, the 17 input should be set low 
or set to match the 18 input, and the 10 input is a "Don't Care" 

Single-Precision Floating Point Square Root 

The following four sample microinstructions select square root operations on single- 
precision floating point operands. Each example includes a timing diagram for a different 
setting of the internal register controls (P1PES2-PIPES0). 

The first example shows an instruction to perform SQRT | A| on a wrapped number 
with only the RA and RB input registers enabled (PIPES2-PIPES0 = 110). The result 
is available after the tenth rising clock edge and may be valid only for that cycle. The 
next instructions and data operand may be loaded on the following rising clock. 

GLKMODE = 0 PIPES = 110 Operation: SQRT |A| 
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Figure 34. Single-Precision Floating Point Square Root 
(PIPES2-PIPES0 =110) 
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The square root of a wrapped number is calculated in the second example. Both the 
input registers and multiplier pipeline are enabled, allowing the next instruction to be 
loaded on the tenth rising clock (see Figure 35). 

CLKMODE = 0 PIPES = 100 Operation: SQRT A(wr) 
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Figure 35. Single-Precision Floating Point Square Root 
(PIPES2-PIPES0 - 100) 
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Enabling both the input and output registers (PIPES2-PIPES0 = 010) in the next 
example delays the output by one cycle. This instruction calculates -SQRT |A| 

CLKMODE = 0 PIPES = 010 Operation: - SQRT |A| 
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Figure 36. Single-Precision Floating Point Square Root 
(PIPES2-PIPES0 = 010) 
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The fourth instruction performs SORT A with all registers enabled. The output is valid 
after the eleventh rising clock edge, and the next instruction may be loaded on the 
tenth rising clock. 
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Figure 37. Single-Precision Floating Point Square Root 
(PIPES2-PIPES0 - 000) 
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Double-Precision Floating Point Square Root 

The next four examples show square root operations on double-precision floating point 
operands. Each example includes a timing diagram for a different setting of the internal 
register controls (PIPES2-PIPES0). Both clock modes 0 and 1 are used in the examples. 

The first sample instruction calculates - SQRT A. Clock mode 1 is shown. With only 
the input registers RA and RB enabled, the output is valid after the sixteenth rising 
clock edge and may be stable for that cycle only. If the next instruction is a double- 
precision operation, the first half of the data operands may be loaded on the falling 
edge of clock 16 (assuming clock mode 1). The second half of the data and the 
instruction are loaded on the seventeenth rising clock edge. A single-precision operation 
must be loaded on the seventeenth rising edge. 

CLKMODE = 1 PIPES = 110 Operation: - SQRT A 
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Figure 38. Double-Precision Floating Point Square Root 
(PIPES2-PIPES0 = 110) 
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The second example shows an instruction to perform SQRT | A| . Because the multiplier 
pipeline is enabled (PIPES2-PIPES0 = 100')., the next double-precision operand and 
instruction may be loaded on the falling edge of clock 1 5 (using clock mode 1) and 
the rising edge of clock 16. If the next operation is single precision, it must be loaded 
on the sixteenth rising clock. 

CLKMODE = 0 PIPES = 100 Operation: SQRT |A| 
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Figure 39. Double-Precision Floating Point Square Root 
(PIPES2-PIPES0 - 100) 
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In the third example, the operand is a wrapped number. With both input and output 
registers enabled (PIPES2-PIPES0 = 010), the output is valid after the seventeenth 
clock cycle. If the next operation is double precision, the first. half operands may be 
loaded on the sixteenth rising clock edge, using clock mode 0. The remaining operands 
and instruction are then loaded on the seventeenth rising edge. If the next operation 
is single precision, the instruction and operands are loaded on the rising edge of 
clock 17. 

CLKMODE = 1 PIPES = 010 Operation: SQRT A(wr) 



1 I 

0-0 



S 
E 

CCC L 

L 0 0 P P S S M S S _ 

KNN I I EE S BEE R_ 

M F F P P LL RR FEES/ YLLEH 

0 II EE 0 0 NN ANNS _ O OOTS'S SATT 
DGGSS PP DD SRRCLEEEETT ELPP 
E 1-0 2-0 7-0 1-0 T A B C S Y C S P 1-0 T T 1-0 



001 0110 101x 1 10 010 1111 1111 00 0 1 1 x x 0 0 0 x 11 1 1 11 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 



CLK 



INST 



I I 
I I 



in 

i 

i 

/ SQUARE \ 

!\ " 0OT A 



I v_ f I 

j-*su1 



fllOJ 



UNDETERMINED 



UNDETERMINED 



J NEXT (DP) V" 

^ul i I 



Figure 40. Double-Precision Floating Point Square Root 
(PIPES2-P1PES0 = 010) 
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All registers are enabled in the fourth example, which calculates SQRT A. Using clock 
mode 0, the next double precision operation may be loaded on the fifteenth and 
sixteenth rising clock edges. A single precision operation would load on the sixteenth 
clock. The output is available after the sixteenth rising clock edge and may only be 
valid for that cycle (see Figure 41). 

CLKMODE = 0 PIPES = 000 Operation: SQRT A 
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Figure 41 . Double-Precision Floating Point Square Root 
(PIPES2-PIPES0 = 000) 
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Integer Square Root 



The following example microinstructions perform square root operations on single 
precision integer operands. Absolute values and negated results are not valid options 
for integer calculations. Each example includes a timing diagram for a different setting 
of the internal register controls (PIPES2-PIPES0). 

The first instruction performs the square root of an unsigned integer with only the 
RA and RB input registers enabled. The output is available after the nineteenth rising 
clock edge and may only be valid for that cycle. The next instruction may be loaded 
on the twentieth rising edge. 

CLKMODE = 0 PIPES = 110 Operation: SORT A 
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Figure 42. Integer Square Root 
(PIPES2-PIPES0 = 110) 



oo 
oo 
h- 
o 
< 



7-1 1 9 



Enabling the multiplier pipeline register (PIPES2-PIPES0 = 100) in the second example 
allows the next instruction to be loaded one cycle earlier, on the nineteenth clock. 
The output will be valid after the nineteenth rising clock edge. (See Figure 43). 

CLKMODE = 0 PIPES = 100 Operation: SQRT A 
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Figure 43. Integer Square Root 
(PIPES2-PIPES0 - 100) 
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The third example instruction performs SQRT A on a two's complement integer 
operand. With both input and output registers enabled (PIPE2-PIPE0 = 010), the output 
is valid after the twentieth rising clock edge. The next instruction may be loaded on 
the twentieth clock. 

CLKMODE = 0 PIPES = 010 Operation: SQRT A 



S 
E 





c c c 








L 








L 0 0 P P 


ss 






M 




S S 




K N N M 


E E 






S 




BEE R 




MFF P P 


L L 


R R 


F EE S 


/ 




Y L L E H 


I 


0 II EE 


0 0 


NN 


ANNR 




OOOTSS SATT 


11 


DGGSS 


PP 


D D 


SRRC 


L 


EEE 


E T T E L PP 


0-0 


E 1-0 2-0 


7-0 


1-0 


T A B C 


SYCS 


P 1-0 T T 1-0 



010 0100 OOOx 0 01 010 1111 1111 00 0 1 1 x x 0 0 0 x 11 1 1 11 



1 2 3 4 5 6 7 8 9 10 11 1213 14 1516 17 18 19 20 



CLK 



INST 




I iw-thl. 
' ^ul 



UNDETERMINED 



UNDETERMINED 



Figure 44. Integer Square Root 
(P1PES2-PIPES0 = 010) 
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The final example shows SQRT A with all registers enabled. The next instruction may 
be loaded on the nineteenth rising clock edge, but the output will not be available until 
after the twentieth rising clock. 

CLKMODE = 0 PIPES = 000 Operation: SQRT A 
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Figure 45. Integer Square Root 
(PIPES2-PIPES0 - 000) 
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Chained Multiplier/ALU Operations 

Simultaneous multiplier and ALU functions can be selected in chained mode to support 
calculation of sums of products or products of sums. Operations selectable in chained 
mode overlap partially with those selectable in independent multiplier or ALU operating 
mode. Format conversions, absolute values, and wrapping or unwrapping of denormal 
numbers are not available in chained mode. 

To calculate sums of products, the FPU can operate on external data inputs in the 
multiplier while the ALU operates on feedback from the previous calculation. The 
operand selects SELOPS7-SELOPS0 can be set to select multiplier inputs from the 
RA and RB registers and ALU inputs from the P and S registers. 

The sample microinstruction sequence shown in Tables 36 and 37 performs the 
operations for multiplying sets of data operands and accumulating the results, the basic 
operations involved in computing a sum of products. 

Table 36 represents the operations, clock cycles, and register contents for a single- 
precision sum of four products. Registers used include the RA and RB input registers 
and the product (P) and sum (S) registers. 



Table 36. Single-Precision Sum of Products (PIPES2-PIPES0 = 010) 



CLOCK 


MULTIPLIER/ALU 


PSEUDOCODE 


CYCLE 


OPERATIONS 


1 


Load A, B 
A * B 


A -* RA, B RB 


2 


Pass P(AB) to S 


C-*RA, D^RB 




Load C, D 




C * D 


A * B P(AB) 


3 


S(AB) + P(CD) 


P(AB) + 0 ~* S(AB) 




Load E, F 


E - RA, F - RB 




E * F 


C * D - P(CD) 


4 


S(AB + CD) + P(EF) 


S(AB) + P(CD)-*S(AB + CD) 




Load G, H 


G - RA, H ■* RB 




G * H 


E * F- P(EF) 


5 


S(AB + CD) + EF) + P(GH) 


S(AB + CD) + P(EF) - S(AB + CD .+ EF) 
G * H-^ P(GH) 


6 


New Instruction 


S(AB + CD + EF) + P(GH) ~* S(AB + CD + EF + GH) 



A microcode sequence to generate this sum of product is shown in Table 37. Only 
three instructions in chained mode are required, since the multiplier begins the 
calculation independently and the ALU completes it independently. 




oo 
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Table 37. Sample Microinstructions for Single-Precision Sum of Products 



S 
E 

CCC L 





L 


0 0 


P P 


S S 












M 










S S 










K 


N N 


I I 


E E 












S 








B 


E E 


R 








M 


F F 


P P 


L L 


R R 


F 


E 


E 


S 


/ 








Y 


L L 


E 


H 






0 


I I 


E E 


0 0 


N N 


A 


N 


N 


R 




0 


0 


0 


T 


SS 


S 


A 


T 1 


1 1 


D 


G G 


S S 


PP 


D D 


S 


R 


R 


C 


L 


E 


E 


E 


E 


T T 


E 


L 


P F 


9-0 


E 


1-0 


2-0 


7-0 


1-0 


T 


A 


B 


C 


S 


Y 


C 


S 


P 


1-0 


T 


T 


1-< 


00 0100 0000 


0 


01 


010 


1111 XXXX 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


1 


1 


1 1 


10 0110 0000 


0 


01 


010 


1111 xxxx 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


1 


1 


11 


10 0000 0000 


0 


01 


010 


1111 1010 


00 


0 


1 


1 


X 


X 


X 


X 


x 


X 


XX 


1 


1 


11 


10 0000 0000 


0 


01 


010 


xxxx 1010 


00 


0 


1 


1 


X 


X 


X 


X 


X 


X 


XX 


1 


1 


11 


00 0000 0000 


0 


01 


010 


xxxx 1010 


00 


0 


X 


X 


X 


X 


X 


X 


X 


X 


XX 


1 


1 


11 


XX xxxx xxxx 


x 


xx 


XXX 


xxxx xxxx 


XX 


X 


X 


X 


X 


X 


0 


0 


0 


X 


XX 


1 


1 


11 



Fully Pipelined Double-Precision Operations 

Performing fully pipelined double-precision operations requires a detailed understanding 
of timing constraints imposed by the multiplier. In particular, sum of products and 
product of sums operations can be executed very quickly, mostly in chained mode, 
assuming that timing relationships between the ALU and the multiplier are coded 
properly. 

Pseudocode tables for these sequences are provided, (Table 38 and Table 39) showing 
how data and instructions are input in relation to the system clock. The overall patterns 
of calculations for an extended sum of products and an extended product of sums 
are presented. These examples assume FPU operation in CLKMODE 0, with the CONFIG 
setting HL to load operands by MSH and LSH, all registers enabled 
(PIPES2 - PIPES0 = LLL), and the C register clock tied to the system clock. 

PV in the sum of products timing table, the two initial products are generated in 
WM independent multiplier mode. Several timing relationships should be noted in the table. 
^ m The first chained instruction loads and begins to execute following the sixth rising 
(/> edge of the clock, after the first product PI has already been held in the P register 
2 for one clock. For this reason, P1 is loaded into the C register so that P1 will be stable 
for two clocks. 

> 

O On the seventh clock, the ALU pipeline register loads with an unwanted sum, P1 + P1. 
^ However, because the ALU timing is constrained by the multiplier, the S register will 
00 not load until the rising edge of CLK9, when the ALU pipe contains the desired sum, 
^ P1 +. P2. The remaining sequence of chained operations then execute in the desired 
manner. 
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Table 38. Pseudocode for Fully Pipelined Double-Precision Sum of Products* 
{CLKM = 0, CONFIG = 10, PIPES -000, CLKC^SYSCLK) 



CLK 


DA 
BUS 


DB 
BUS 


TEMP 
REG 


INS 
BUS 


INS 

REG 


RA 

REG 


RB 
REG 


MUL 
PIPE 


P 
REG 


C 
REG 


ALU 
PIPE 


s 

REG 


Y 
BUS 


J 1 


A1 MSH 


B1 MSH 


A1 ,B1 MSH 


A1 * B1 




















_T2 


A1 LSH 


B1 LSH 


A1,B1MSH 


A1 *B1 


A1 *B1 


At 


B1 














r~ o 

_J 3 


A2 MSH 


B2 MSH 


A2,B2MSH 


A2 * B2 


A1 * B1 


A1 


B1 


At * B1 












_T4 


A2 LSH 


B2 LSH 


A2,B2MSH 


A2*B2 


A2 * B2 


A2 


B2 


A1 *B1 












_Tb 


A3 MSH 


B3 MSH 


A3,B3MSH 


PR + CR 
A3*B3 


A2*B2 


A2 


B2 


A2*B2 


P1 










J~6 


A3 LSH 


B3 LSH 


A3,B3MSH 


PR + CR 
A3*B3 


PR + CR, 
A3*B3 


A3 


B3 


A2*B2 


P1 


P1 








J"7 


A4 MSH 


B4 MSH 


A4,B4MSH 


PR + SR 
A4*B4 


PR + SR, 
A3*B3 


A3 


B3 


A3*B3 


P2 


P1 


P1 +P1 






J~8 


A4 LSH 


B4 LSH 


A4,B4MSH 


PR + SR 
A4*B4 


PR + SR, 
A4*B4 


A4 


B4 


A3 * B3 


P2 


P1 


P1 +P2 






J~9 


A5 MSH 


B5 MSH 


A5,B5MSH 


PR + SR 
A5*B5 


PR + SR, 
A4*B4 


A4 


B4 


A4*B4 


P3 


P2 


St + P2 


S1 




J~io 


A5 LSH 


B5 LSH 


A5,B5MSH 


PR + SR 
A5*B5 


PR + SR, 
A5*B5 


A5 


B5 


A4*B4 


P3 


P3 


S1 +P3 


S1 




J~11 


A6 MSH 


B6 MSH 


A6,B6(M) 


PR + SR 
A6*B6 


PR + SR, 
A5*B5 


A5 


B5 


A5*B5 


P4 


P3 


XXXXX 


S2 




J"12 





























tpR = Product Register 
SR = Sum Register 
CR = Constant (C) Register 
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Table 39. Pseudocode tor Fully Pipelined Double-Precision Product of Sums * 
(CLKM = 0, CONFIG = 10, PIPES = 000, CLKC^SYSCLK) 



CLK 


DA 
BUS 


DB 
BUS 


TEMP 
REG 


INS 
BUS 


INS 
REG 


RA 
REG 


RB 
REG 


MUL 
PIPE 


P 
REG 


c 

REG 


ALU 
PIPE 


S 
REG 


Y 
BUS 


J 1 


A1 (M) 


B1(M) 


A1.BKM) 


A1 + B1 




















J~2 


AKL) 


B1(L) 


A1.BKM) 


A1 +B1 


A1 +B1 


A1 


B1 














J~3 


A2(M) 


B2(M) 


A2,B2(M) 


A2 + B2 


A1 +B1 


A1 


B1 








A1.+ B1 






_T4 


A2(L) 


B2(L) 


A2,B2(M) 


A2 + B2 


A2 + B2 


A2 


B2 








A1 + B1 


S1 




_T5 


A3(M) 


B3(M) 


A3,B3(M) 


CR*SR 
A3 + B3 


A2 + B2 


A2 


B2 






ENRC = L 
S1 


A2 + B2 


S1 




J~6 


A3(L) 


B3(L) 


A3,B3(M) 


CR*SR 
A3 + B3 


CR*SR 
A3 + B3 


A3 


B3 






S1 


A2 + B2 


S2 




_T? 


XXX 


XXX 


XXX 


NOP 


CR*SR 
A3 + B3 


A3 


B3 


S1 *S2 




S1 


A3 + B3 


S2 




J8 


A4(M) 


B4(M) 


A4,B4(M) 


PR*SR 
A4 + B4 


NOP 


ENRA = L 
A3 


ENRB = L 
B3 


Si *S2 




S1 


A3 + B3 


XXX 




J~9 


A4(L) 


B4(L) 


A4,B4(M) 


PR*SR 
A4 + B4 


PR * SR 
A4 + B4 


A4 


B4 


XXX 


P1 


S1 


XXX 


S3 




_TlO 


XXX 


XXX 


XXX 


NOP 


PR*SR 
A4 + B4 


A4 


B4 


P1 * S3 


P1 


S1 


A4 + B4 


S3 




J"" 


A5(M) 


B5(M) 


A5,B5(M) 


PR*SR 
A5 + B5 


NOP 


ENRA = L 
A4 


ENRB = L 
B4 


P1 *S3 


XXX 


S1 


A4 + B4 


XXX 




J~12 


A5(L) 


B5(L) 


A5,B5(M) 


PR * SR 
A5 + B5 


PR*SR 
A5 + B5 


A5 


B5 


XXX 


P2 


S1 


X 


S4 





NOTE: NOP instruction is 01 1 0000 0000. 
1 PR = Product Register 

SR = Sum Register 

CR = Constant (C) Register 



In the product of sums timing table, the two initial sums are generated in independent 
ALU mode. The remaining operations are shown as alternating chained operations 
followed by NOPs (no operations). The NOPs are necessary to provide an extra cycle 
during which the multiplier outputs the current intermediate product. The current sum 
and the latest intermediate product are then fed back to the multiplier inputs for the 
next chained operations. In this manner a double-precision product of sums is generated 
in three system clocks, as opposed to two clocks for a double-precision sum of 
products. 

Mixed Operations and Operands 

Using mixed-precision data operands or performing sequences of mixed operations 
may require adjustments in timing, operand precision, and control settings. To simplify 
microcoding sequences involving mixed operations, mixed-precision operands, or both, 
it is useful to understand several specific requirements for mixed-mode or mixed- 
precision processing. 

Calculations involving mixed-precision operands must be performed as double-precision 
operations. The instruction settings (18-17) should be set to indicate the precision of 
each operand from the RA and RB input registers. (Feedback operands from internal 
registers are also double-precision.) Mixed-precision operations should not be performed 
in chained mode. 

Timing for operations with mixed-precision operands is the same as for a corresponding 
double-precision operation. In a mixed-precision operation, the single-precision operand 
must be loaded into the upper half of its input register. 

Most format conversions also involve double-precision timing. Conversions between 
single- and double-precision floating point format are treated as mixed-precision 
operations. During integer to floating point conversions, the integer input should be 
loaded into the upper half of the RA register. 

In applications where mixed-precision operations is not required, it is possible to tie 
the 18-17 instruction inputs together so that both controls always select the same 
precision. 

Sequences of mixed operations may require changes in multiple control settings to 

deal with changes in timing of input, execution, and output of results. Figure 46 shows 

a simplified timing waveform for a series of mixed operations: ^ 

00 
h- 

< 

rv 
2 

■ ■ ' (/> 
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1 2 3 4 5 6 7 8 9 10 11 12 13 



FUNCTION 
AND DATA 


A,B 


A B 


CD 


CD 




E,F 


G,H 


G,H 


U 


U 


K,L 


M,N 




RESULTS 
AND STATUS 






XXXX 


A,B 


XXXX 


CD 


E,F 


E,F 


G,H 


G,H 


U 


K,L 


M,N 



A,B,C,D — double precision multiply; E,F — single precision operation; G,H,I,J — double 
precision add; K,L — single precision opration. A double precision number is not required to 
be held on the outputs for two cycles unless it is followed by a like double precision function. 
If a double precision multiply is followed by single precision operation, there must be one open 
clock cycle. 

Figure 46. Mixed Operations and Operands 
(PIPES2-PIPES0 =110, CLKMODE = 0) 



In this sequence, the fifth cycle is left open because a single-precision multiply follows 
a double-precision multiply. If the SP multiply were input during the period following 
the fourth rising clock edge, the result of the preceding operation would be overwritten, 
since an SP multiply executes in one clock cycle. To avoid such a condition, the FPU 
will not load during the required open cycle. 

Because the sequence of mixed operations places constraints on output timing, only 
one cycle is available to output the double-precision (G * D) result. By contrast/the 
SP multiply (E * F) is available for two cycles because the operation which follows 
it does not output a result in the period following the seventh rising clock edge. In 
general, the precision and timing of each operation affects the timing of adjacent 
operations. 

Control settings for CLKMODE and registers must also be considered in relation to 
precision and speed of execution. In Figure 47, a similar sequence of mixed operations 
is set up for execution in fully pipelined mode: 

! CLOCK CYCLE 



2 

> 

O 
H 
00 
00 

*4 



FUNCTION 
AND DATA 

RESULTS 
AND STATUS 



1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


A B 




CD 




E,F 


G,H 


U 


K,L 


M,N 


CP 




Q,R 












A,B 


A,B 


CD 


E,F 


G,H 


U 


K,L 


M,N 


M,N 



A,B,C,D — double precision multiply; E,F — single precision operation; G,H, — double precision 
add; l,J,K,L,M,N — single precision operation; 0,P,Q,R — double precision multiply. In clock 
mode 1, a double precision result is two cycles long only when a double precision multiply is 
followed by a double precision multiply. 

Figure 47. Mixed Operations and Operands 
(PIPES2-PIPES0 - 000, CLKMODE =1) 
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Although the data operands can be loaded in one clock cycle with CLKMODE set high, 
enabling two additional internal registers delays the (A * B) result one cycle beyond 
the previous example. Again, an open cycle is required after the (C * D) operation 
because the next operation is single precision. The result of the (C * D) multiply is 
available for one cycle instead of two, also because the following operation is single 
precision. With this setting of CLKMODE and PIPES2-PIPES0, a double-precision result 
is only available for two clock cycles when one DP multiply follows another DP multiply. 



Matrix Operations 

The 'ACT8847 floating point unit can also be used to perform matrix manipulations 
involved in graphics processing or digital signal processing. The FPU multiplies and 
adds data elements, executing sequences of microprogrammed calculations to form 
new matrices. 

Representation of Variables 

In state representations of control systems, an n-th order linear differential equation 
with constant coefficients can be represented as a sequence of n first-order linear 
differential equations expressed in terms of state variables: 



dx1 
dt 



x 2, 



dx(n-1) 
dt 



xn 



For example, in vector-matrix form the equations of an nth-order system can be 
represented as follows: 





x1 






x2 




d 






dt 








xn 





a11 


a12 . . 


a1n 


an1 


an2 . . 


ann 





x1 






x2 


+ 




xn 





b11 . 


. . bin 


bn1 . 


bnn 



un 



or, X = ax + bu 

Expanding the matrix equation for one state variable, dx1/dt, results in the following I s * 
expression: qq 

00 

X1 = (all * x1 + . . . + a1n * xn) + (b1 1 * uT + . . . + bin * un) Q 

< 

where X1 = dx1/dt. <tf 




CO 
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Sequences of multiplications and additions are required when such state space 
transformations are performed, and the 'ACT8847 has been designed to support such 
sum-of-products operations. An n x n matrix A multiplied by an n x n matrix X yields 
an n x n matrix C whose elements cij are given by this equation: 



n 

cij = ^ aik * xkj for i = 1 , 
k=1 



,n j = 1, 



(1) 



For the cij elements to be calculated by the 'ACT8847, the corresponding elements 
aik and xkj must be stored outside the 'ACT8847 and fed to the 'ACT8847 in the 
proper order required to effect a matrix multiplication such as the state space system 
representation just discussed. 

Sample Matrix Transformation 

The matrix manipulations commonly performed in graphics systems can be regarded 
as geometrical transformations of graphic objects. A matrix operation on another matrix 
representing a graphic object may result in scaling, rotating, transforming, distorting, 
or generating a perspective view of the image. By performing a matrix operation on 
the position vectors which define the vertices of an image surface, the shape and 
position of the surface can be manipulated. 

The generalized 4x4 matrix for transforming a three-dimensional object with 
homogeneous coordinates is shown below: 



a 


b 


c 


d 


e 


f 


g 


h 


i 


j 


k 


I 


m 


n 


0 


P 



The matrix T can be partitioned into four component matrices, each of which produces 
' a specific effect on the resultant image: 



z 

> 

o 

H 
00 
00 





3 


3x3 


X 




1 


1 x 3 


1 X 1 
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The 3x3 matrix produces linear transformation in the form of scaling, shearing and 
rotation. The 1 x 3 row matrix produces translation, while the 3 x 1 column matrix 
produces perspective transformation with multiple vanishing points. The final single 
element 1 x 1 produces overall scaling. Overall operation of the transformation matrix 
T on the position vectors of a graphic object produces a combination of shearing, 
rotation, reflection, translation, perspective, and overall scaling. 

The rotation of an object about an arbitrary axis in a three-dimensional space can be 
carried out by first translating the object such that the desired axis of rotation passes 
through the origin of the coordinate system, then rotating the object about the axis 
through the origin, and finally translating the rotated object such that the axis of rotation 
resumes its initial position. If the axis of rotation passes through the point P = [a b c 1 ], 
then the transformation matrix is representable in this form: 



Ixyz h] = [x y z 1 j 



1 


0 


0 


0 


0 


1 


0 


0 


0 


0 


1 


0 


- a 


-b 


- c 


1 



1 


0 


0 


0 


0 


1 


0 


0 


0 


0 


1 


0 


a 


b 


c 


1 



(2) 



translation rotation translation 
to origin about back to initial 

origin position 




oo 
oo 
h- 
o 
< 

Z 
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where R may be expressed as: 





n1 2 + (1-n) 2 cos</> 


n1n2(1-cos</>) + n3sin0 


n1 n3(1-cos(/>) -n2sin$ 


0 


R = 


n 1 n2( 1 -cos<£) - n3sin<£ 


n2 2 + (1-n2) 2 cos(/> 


n2n3(1-cos(/>) + n1sin<£ 


0 




n1n3(1-cos0) + n2sin0 


n2n3(1 ~cos</>) -n1sin(/> 


n3 2 + (1-n3) 2 cos</> 


0 




0 


0 


0 


1 



and n1 = q1/(q1 2 + q2 2 + q3 2 )1/ 2 = direction cosine for x-axis of 

rotation 

n2 = q2/(q1 2 + q2 2 + q3 2 )1/ 2 = direction cosine for y-axis of rotation 
n3 = q3/(q1 2 + q2 2 + q3 2 p/ 2 = direction cosine for z-axis of rotation 
n = (n1 n2 n3) = unit vector for Q 

Q = vector defining axis of rotation = [q1 q2 q3] 
<t> = the rotation angle about Q 




z 

> 
o 

H 
00 
00 



A general rotation using equation (2) is effected by determining the [x y z] coordinates 
of a point A to be rotated on the object, the direction cosines of the axis of rotation 
[n1,-n2, n3], and the angle 0 of rotation about the axis, all of which are needed to 
define matrix [R]. Suppose, for example, that a tetrahedron ABCD, represented by 
the coordinate matrix below is to be rotated about an axis of rotation RX which passes 
through a point P = [5 -631] and whose direction cosines are given by unit vector 
[n1 = 0.866, n2 = 0.5, n3 = 0.707]. The angle of rotation 0 is 90 degrees (see 
Figure 24). The rotation matrix [R] becomes 



2 


-3 


3 


1 


1 


-2 


2 


1 


2 


-1 


2 


1 


2 


-2 


2 


1 



A 
B 
C 
D 



R = 



0.750 


1.140 


0.112 


0 


-0.274 


0.250 


1.220 


0 


1.112 


-0.513 


0.500 


0 


0 


0 


0 


1 
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(1) THIS ARROW DEPICTS THE FIRST TRANSLATION 

{2) THIS AROW DEPICTS THE 90° ROTATION 

(3) THIS ARROW DEPICTS THE BACK TRANSLATION 



Figure 48. Sequence of Matrix Operations 



The point transformation equation (2) can be expanded to include all the vertices of 
the tetrahedron as follows: 



xa 


ya 


za 


hi 


xb 


Vb 


zb 


h2 


xc 


yc 


zc 


h3 


xd 


yd 


zd 


h4 



2 


-3 


3 


1 


1 


-2 


2 


1 


2 


-1 


2 


1 


2 


-2 


2 


1 



10 00 
0 1 00 
00 10 

-5 6-3 1 



translation 
to origin 



0.750 1.140 0.112 0 
-0.274 0.250 1.22 0 
1.112 -0.513 0.500 0 
0 0 0 1 



rotation about origin 



1 


0 


0 0 


0 


1 


0 0 


0 


0 


1 0 


5- 


6 


3 1 



translation 
back to 

initial 
position 




(3) 



oo 
oo 
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< 

Z 
to 
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The 'ACT8847 floating-point unit can perform matrix manipulation involving 
multiplications and additions such as those represented by equation (1). The matrix 
equation (3) can be solved by using the 'ACT8847 to compute, as a first step, the 
product matrix of the coordinate matrix and the first translation matrix of the right- 
hand side of equation (3) in that order. The second step involves postmultiplying the 
rotation matrix by the product matrix. The third step implements the back-translation 
by premultiplying the matrix result from the second step by the second translation 
matrix of equation (3). Details of the procedure to produce a three-dimensional rotation 
about an arbitrary axis are explained in the following steps: 

Step 1 

Translate the tetrahedron so that the axis of rotation passes through the origin. This 
process can be accomplished by multiplying the coordinate matrix by the translation 
matrix as follows: 



2 


-3 


3 


1 


1 


-2 


2 


1 


2 


-1 


2 


1 


2 


-2 


2 


1 



1 


0 


0 


0 




(2-5) 


(-3 + 6) 


(3-3) 


1 


0 


1 


0 


0 




(1-5) 


(-2 + 6) 


(2-3) 


1 


0 


0 


1 


0 




(2-5) 


(-1+6) 


(2-3) 


1 


-5 


6 


-3 


1 




(2-5) 


(-2 + 6) 


(2-3) 


1 



translation 
to origin 



vertices of translated 
tetrahedron 




-3 


+ 3 


0 


1 


-4 


+ 4 


-1 


1 


-3 


+ 5 


-1 


1 


-3 


+ 4 


-1 


1 



AT 
BT 
CT 
DT 



z 
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The 'ACT8847 could compute the translated coordinates AT, BT, CT, DT as indicated 
above. However, an alternative method resulting in a more compact solution is 
presented below. 

Step 2 

Rotate the tetrahedron about the axis of rotation which passes through the origin after 
the translation of Step 1 . To implement the rotation of the tetrahedron, postmultiply 
the rotation matrix [R] by the translated coordinate matrix from Step 1 . The resultant 
matrix represents the rotated coordinates of the tetrahedron about the origin as follows: 



-3 


3 


0 


1 




0.750 


1.140 


0.112 


0 




-3.072 


-2.670 


3.324 


1 


-4 


4 


-1 


1 




-0.274 


0.250 


1.22 


0 




-5.208 


-3.047 


3.932 


1 


-3 


5 


-1 


1 




1.112 


-0.513 


0.500 


0 




-4.732 


-1.657 


5.264 


1 


-3 


4 


-1 


1 




0 


0 


0 


1 




-4.458 


-1.907 


4.044 


1 



rotation about origin 



rotated coordinates 



Step 3 

Translate the rotated tetrahedron back to the original coordinate space. This is done 
by premultiplying the resultant matrix of Step 2 by the translation matrix. The following 
calculations produces the final coordinate matrix of the transformed object: 
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A more compact solution to these transformation matrices is a product matrix that 
combines the two translation matrices and the rotation matrix in the order shown in 
equation (3). Equation (3) will then take the following form: 



xa ya za hi 
xb yb zb h2 
xc yc zc h3 
xd yd zd h4 
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The newly transformed coordinates resulting from the postmultiplication of the 
transformation matrix by the coordinate matrix of the tetrahedron can be computed 
using equation (1) which was cited previously: 



n 

cij = aik * xkj for i = 1 , 
k = 1 



,n j = 1, . 



(1) 



For example, the coordinates may be computed as follows: 



xa = c11 = a11 * x11 + a12 * x21 + a13 * x31 + a14 * x41 

= 2 * 0.750 + (-3) * (-0.274) + 3 * 1.112 + 1 * (-3.73) 

= 1.5 + 0.822 + 3.336 - 3.73 

= 1.928 



ya = c12 = all * x12 + a12 * x22 + a13 * x32 + a14 * x42 

= 2 * 1.140 + (-3) * 0.250 + 3 * (-0.513) + 1x(-8.661) 

= 2.28 -0.75 - 1.539 - 8.661 

= -8.67 



za = c13 = al 1 * x13 + a12 * x23 +. a13 * x33 + a14 * x43 

= 2 * 0.112 + (-3) * 1.220 + 3 * 0.500 + 1 * 8.260 

= 0.224 - 3.66 + 1.5 + 8.260 

= 6.324 



hi = c14 = a1 1 * x14 + a12 * x24 + a13 * x34 + a14 * x44 

= 2*0 + (-3)* 0 + 3*0 + 1*1 

= 0 + 0 + 0 + 1 

= 1 

► A' = [1.928 - 8.67 6.324 1] 

The other rotated vertices are computed in a similar manner: 

B' = [-5.208 -3.047 3.932 1] £ 
C = [-4.732 -1.657 5.264 1) & 
D' =[-4.458 -1.907 4.044 1] CO 

O 

Microinstructions for Sample Matrix Manipulation ^ 

The 'ACT8847 FPU can compute the coordinates for graphic objects ov6r a broad 2 
dynamic range. Also, the homogeneous scalar factors hi , h2, h3 and h4 may be made C/) 
unity due to the availability of large dynamic range. In the example presented below, 
some of the calculations pertaining to vertex A' are shown but the same approach 
can be applied to any number of points and any vector space. 
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The calculations below show the sequence of operations for generating two 
coordinates, xa and ya, of the vertex A' after rotation. The same sequence could be 
continued to generate the remaining two coordinates for A' (za and hi). The other 
vertices of the tetrahedron, B', C\ and D\ can be calculated in a similar way. 

A microcode sequence to generate this matrix multiplication is shown in Table 40. 
Table 41 presents a pseudocode description of the operations, clock cycles, and register 
contents for a single-precision matrix multiplication using the sum-of-products sequence 
presented in an earlier section/Registers used include the RA and RB input registers 
and the product (P) and sum (S) registers. 

Table 40. Microinstructions for Sample Matrix Multiplication 
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Six cycles are required to complete calculation of xa, the first coordinate, and after 
four more cycles the second coordinate ya is output. Each subsequent coordinate can 
be calculated in four cycles so the 4-tuple for vertex A' requires a total of 18 cycles 
to complete. 

Calculations for vertices B', C, and D', can be executed in 48 cycles, 16 cycles for 
each vertex. Processing time improves when the transformation matrix is reduced, 
i.e., when the last column has the form shown below: 



0 
0 
0 
1 
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Table 41. Single-Precision Matrix Multiplication (PIPES2-PIPES0 = 010) 



CLOCK 


MULTIPLIER/ALU 


PSEUDOCODE 


CYCLE 


OPERATIONS 


1 


Load a1 1 , x1 1 


a11 - RA, x11 ^RB 




SP Multiply 


p1 = all ... * x11 


2 


Load a 12, x21 


a12 -*RA, x21 -*RB 




SP Multiply 


p2 = a12 * x21 




Pass P to S 


pi - P(pD 


3 


Load a 13, x31 


a 13 RA, x31 -*RB 




SP Multiply 


p3 = a13 * x31, p2 -»P(p2) 




Add Pto S 


P(p1) + 0- S(p1) 


4 


Load a 14, x41 


a14 — RA, x41 - RB 




SP Multiply 


P 4 = a 14 * x41, p3 — Pfp3) 




Add P to S 


P(p2) + S(p1)-S(p1 + p2) 


5 


Load a1 1, x12 


911 - RA, x12 - RB 




SP Multiply 


p5 = a11 * x12, p4 - P(p4) 




Add P to S 


P(p3) + S(p1 + p2) - S(p1 + p2 + p3) 
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I nari fll 9 x?? 

t—U CIVJ CI 1 4b. / Ak.4L.4L~ 


a1 2 -> RA x22 -* RB 




SP Multiply 


p6 = a12 * x22, p5 P(p5) 




Pass P to S 


P(p4) + S(p1 + p2 + p3) - 




Output S 


S(p1 + p2 + p3 + p4) 


7 


Load a13, x32 


a13 -*RA, x32~* RB 




SP Multiply 


p7 = a13 * x32, p6 - P(p6) 




Add P to S 


P(p5) 4- 0 - S(p5) 


8 


Load a 14, x42 


a14-^RA, x42 -+RB 




SP Multiply 


p8 = a14 * x42, p7 - P(p7) 




Add P to S 


P(p6) + S(p5)- S(p5 + p6) 


9 


Next operands 


A - RA, B -* RB 




Next instruction 


pi = A * B, p8 P(p8) 




Add P to S 


P(p7) + S(p5 + p6) - S(p5 + p6 .+ p7) 


10 


Next operands 


C - RA, D - RB 




Next instruction 


pj = C * D, pi - P(pi) 




Output S 


P(p8) + S(p5 + p6 + p7) -* 

S(p5 + p6 + p7 + p8) 



m 



The h-scalars 111,112, h3, and h4 are equal to 1 . The number of clock cycles to generate 
each 4-tuple can then be decreased from 1 6 to 1 3 cycles. Total number of clock cycles qq 
to calculate all four vertices is reduced from 66 to 54 clocks. Figure 49 summarizes 00 
the overall matrix transformation. q 

< 

r** 
Z 
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Figure 49. Resultant Matrix Transformation 

This microprogram can also be written to calculate sums of products with all pipeline 
registers enabled so that the FPU can operate in its fastest mode. Because of timing 
relationships, the C register is used in some steps to hold the intermediate sum of 
products. Latency due to pipelining and chained data manipulation is 11 cycles for 
calculation of the first coordinate, and four cycles each for the other three coordinates. 

After calculation of the first vertex, 16 cycles are required to calculate the four 




coordinates of each subsequent vertex. Table 42 presents the sequence of calculations 
for the first two coordinates, xa and ya. 
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Table 42. Fully Pipelined Sum of Products (PIPES2-PIPES0 - 000) 
(Bus or Register Contents Following Each Rising Clock Edge) 
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t Contents of this register are not valid during this cycle. 

Products in Table 42 are numbered according to the clock cycle in which the operands 
and instruction were loaded into the RA, RB, and I register, and execution of the 
instruction began. Sums indicated in Table 42 are listed below: 



s1 = p1 + 0 
s2 = p1 + p3 
s3 = p2 + p4 
S 4 = p5 + 0 



s5 = p5 + p7 

s6 = p6 + p8 

s7 = p9 + 0 

s8 = p9 + p1 1 



s9 = p10 + p12 

xA = p1 + p2 + p3 + p4 

yA = p5 + p6 + p7 ■+ p8 
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GLOSSARY 



Biased exponent — The true exponent of a floating-point number plus a constant called 
the exponent field's excess. In IEEE data format, the excess or bias is 127 for single- 
precision numbers and 1023 for double-precision numbers. 

Denormalized number (denorm) — A number with an exponent equal to zero and a 
nonzero fraction field, with the implicit leading (leftmost) bit of the fraction field being 0. 

NaN (not a number) — Data that has no mathematical value. The 'ACT8847 produces 
a NaN whenever an invalid operation such as 0 * oo is executed. The output format 
for an NaN is an exponent field of all ones, a fraction field of all ones, and a zero sign 
bit. Any number with an exponent of all ones and a nonzero fraction is treated as a 
NaN on input. 

Normalized number — A number in which the exponent field is between 1 and 254 
(single precision) or 1 and 2046 (double precision). The implicit leading bit is 1. 

Wrapped number — A number created by normalizing a denormalized number's fraction 
field and subtracting from the exponent the number of shift positions required to do 
so. The exponent is encoded as a two's complement negative number. 




7-142 



SN74ACT8818 16-Bit Microsequencer 



SN74ACT8832 32-Bit Registered ALU 



SIM74ACT8836 32- x 32-Bit Parallel Multiplier 



SN74ACT8837 64-Bit Floating Point Processor 



SN74ACT8841 Digital Crossbar Switch 



SN74ACT8847 64-Bit Floating Point/Integer Processor 
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Design Support for Tl's SN74ACT8800 Family 

Tl's '8800 32-bit processor family is supported by a variety of tools developed to aid 
in design evaluation and verification. These tools will streamline all stages of the design 
process, from assessing the operation and performance of an individual device to 
evaluating a total system application. The tools include functional models, behavioral 
models, microcode development software, as well as the expertise of Tl's VLSI Logic 
applications group. 

Functional Evaluation Models Aid in Device Evaluation 

Many design decisions can easily be made and evaluated before hardware or board 
prototypes are needed, using functional evaluation software models. The result is 
shortened design cycles and lower design costs. 

Texas Instruments offers functional evaluation models for many of the devices in the 
'8800 family. These models are written in Microsoft C® and can be used in stand- 
alone mode or as callable functions. 

These models are designed to provide insight into the operation of the devices by 
allowing the designer to write microcode and run it through the model. This allows 
the designer to select the device that best executes a specific application and provides 
a head start in evaluating programming performance. 

The models correctly represent device timing in clock cycles, measured from the input 
of control and data to the output of results and status. Hence, initial performance 
estimates for a particular design can be made by relating the number of clock cycles 
required for an operation to the typical ac timing data for the device. 

Behavioral Simulation Models Simplify System Debugging 

System simulation with behavioral models can further shorten design time and ease 
design effort. The behavioral simulation models that support Tl's '8800 chip set have 
the timing-control and error-handling capability to perform thorough PCB and system 
simulation. These models decrease the time spent in debugging and reduce the number 
of required prototype runs. 

Users of system simulation models report a reduction by more than half in the number 
of prototype runs typically required to produce the highest-quality system. This savings 
in time reduces costs and gets the product to market as much as several months earlier 
than could be done using traditional methods. 



Microsoft C is a registered trademark of Microsoft Corporation 
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Behavioral models for Tl's '8800 family are written at the functional behavioral level 
and, therefore, are faster and easier to use and take up less disk space than some 
other types of simulation models. This higher efficiency means a simulation run can 
include more IC models and yet require less CPU time than an equivalent simulation 
using other types of models. 

These behavioral simulation models also provide explicit error messages that can help 
in the debugging process. For example, if a design violates a device set-up time, the 
model explains, via an error message, what type of violation occurred, at what point 
it occurred in the simulation run, and specifically which part's set-up time was violated. 
Then, the model continues on with the run as if no violation occurred, saving time 
rather than crashing the run at every error. 

In other words, an expert debugger is built right into the simulation. 

The models are available with commercial and military timing and interact with a variety 
of simulators. 

Behavioral Models for Tl's '8800 Family are Easily Obtained 

Texas Instruments has been working closely with both Quadtree Software Corporation 
and Logic Automation Incorporated to produce software behavioral simulation models 
of many of its VLSI devices. Since accuracy is key to solving design problems, we've 
provided Quadtree and Logic Automation with test patterns for most of our devices 
to ensure each model passes the same set of test vectors as does the actual silicon 
device. 

Quadtree offers a library of Designer's Choice™ full-functional behavioral models of 
Texas Instruments '8800 32-bit processor building block devices. 

Logic Automation Smartmodel™ library contains many Texas Instruments products, 
including devices from the '8800 chip set. 

These companies may be contacted directly at the addresses below. General 
information about behavioral model support for the '8800 family may be obtained by 
calling Texas Instruments at (214) 997-5402. 



LOGIC AUTOMATION INCORPORATED 
P.O. Box 310 
Beaverton, OR 97075 
(503) 690-6900 



QUADTREE SOFTWARE CORPORATION 
1170 Route 22 East 
Bridgewater, NJ 08807 
(201)725-2272 



Quadtree and Designer's Choice are trademarks of Quadtree Software Corporation 
Logic Automation and Smartmodel are trademarks of Logic Automation Incorporated 
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'8800 SDB Kit 



Tl offers an '8800 Software Development Board (SDB) kit as an evaluation and training 
tool. The '8800 SDB kit allows users to evaluate performance and write microprograms 
for several of the '8800 building blocks, using a range of software development tools. 

Built on a PC/AT card occupying a single slot, the '8800 SDB includes an ' ACT88 1 8 
microsequencer, 'ACT8832 registered ALU, and an 'ACT8847 floating point unit, along 
with 32K deep by 128 bits wide microcode memory, and 32K deep by 32 bits wide 
local data memory. Interface software is provided so that microcode and local data 
memory may be loaded and read from the PC/AT bus. Documentation is also included 
in the kit. 

Users may write microcode source with the Tl Meta Assembler, Metastep™ Assembler, 
or Hi-Level software tools. Pre-built '8800 SDB definition files are available with each 
of these so that users can begin developing microcode as soon as possible. 

A software simulation of the '8800 SDB is currently being developed. The simulator 
operates in an MS-DOS environment. 

For additional technical information, please contact VLSI Systems Engineering at 
(214) 997-3970. For ordering information, please call your local field sales 
representative. 




O 
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o. 

3 



MetaStep is a trademark of STEP Engineering, Inc. 
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Program Code Generation Using the Tl Meta Assembler 

The Tl Meta Assembler (TIM) provides the means to create object microcode files and 
to support listings for programs that execute in architectures without standard 
instruction sets. The end-product of TIM is an absolute object code module in suitable 
format for downloading to PROM programmers or to the emulator memories of 
development systems. TIM is fully compatible with some other assemblers as well. 

Systems Expertise is a Phone Call Away 

Texas Instruments VLSI Logic applications group is available to help designers analyze 
Tl's high-performance VLSI products, such as the '8800 32-bit processor family. The 
group works directly with designers to provide ready answers to device-related 
questions and also prepares a variety of applications documentation. 

The group may be reached in Dallas, at (214) 997-3970. 
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Overview 



SN74ACT88 18 1 6-Bit Microsequencer 



SN74ACT8832 32-Bit Registered ALU 



SN74ACT8836 32- x 32-Bit Parallel Multiplier 



SN74ACT8837 64-Bit Floating Point Processor 



SN74ACT8841 Digital Crossbar Switch 



SN74ACT8847 64-Bit Floating Point/Integer Processor 
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Mechanical Data 

SN74ACT8818 - 1 1 x 1 1 GC PACKAGE 

SN74ACT8832 1 7 x 1 7 GB PACKAGE 

SN74ACT8836 1 5 x 1 5 GB PACKAGE 

SN74ACT8837 1 7 x 1 7 GB PACKAGE 

SN74ACT8841 15x15 GB PACKAGE 

SN74ACT8847 1 7 x 1 7 GA PACKAGE 
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ALL POSSIBLE PIN LOCATIONS ARE 
SHOWN. SEE APPLICABLE PRODUCT 
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NOTE A: Pins are located within 0,13 (0.005) radius of true position relative to each other at 
maximum material condition and within 0,381 (0.051 ) radius relative to the center of 
the ceramic. 



ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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i (See Note A) 
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ALL POSSIBLE PIN LOCATIONS ARE 
SHOWN SEE APPLICABLE PRODUCT 
DATA SHEETS FOR ACTUAL PIN 
LOCATIONS USED. 



2 3 4 5 6 7 8 9 10 11 



NOTE A: Pins are located within 0,13 (0.005) radius of true position relative to each other at 
maximum material condition and within 0,381 (0.051 ) radius relative to the center of 
the ceramic. 



ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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13 x 13 GB pin grid array ceramic package 



INDEX CORNER - 



35,1 (1.380) 



32,5 (1.280) 



35, 1 (1.380) 
32,5 (1.280) 
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r (See Note A) 
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ALL POSSIBLE PIN LOCATIONS ARE 
SHOWN. SEE APPLICABLE PRODUCT 
DATA SHEETS FOR ACTUAL PIN 
LOCATIONS USED. 
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NOTE A: Pins are located within 0,13 (0.005) radius of true position relative to each other at 
maximum material condition and within 0,381 (0.051 ) radius relative to the center of 
the ceramic. 



ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 



9-7 



13 x 13 GC pin grid array ceramic package 



35,1 (1.380) 
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ALL POSSIBLE PIN LOCATIONS ARE 
SHOWN. SEE APPLICABLE PRODUCT 
DATA SHEETS FOR ACTUAL PIN 
LOCATIONS USED. 



2 3 



5 6 7 8 9 10 11 12 13 




NOTE A: Pins are located within 0,13 (0.005) radius of true position relative to each other at 
maximum material condition and within 0,381 (0.051 ) radius relative to the center of 
the ceramic. 



ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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15 x 15 GB pin grid array ceramic package 
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DIA (4 PLACES) 
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I (See Note A) 
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ALL POSSIBLE PIN LOCATIONS ARE 
SHOWN. SEE APPLICABLE PRODUCT 
DATA SHEETS FOR ACTUAL PIN 
LOCATIONS USED. 



1 2 3 4 5 6 7 8 9 101112 1314 15 

NOTE A: Pins are located within 0,13 (0.005) radius of true position relative to each other at 
maximum material condition and within 0,381 (0.051 ) radius relative to the center of 
the ceramic. 



ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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15x15 GC pin grid array ceramic package 
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2,54 (0.100) T P. 
(See Note A) 



ALL POSSIBLE PIN LOCATIONS ARE 
SHOWN. SEE APPLICABLE PRODUCT 
DATA SHEETS FOR ACTUAL PIN 
LOCATIONS USED. 



1 2 3 4 5 6 7 8 9 10 1112 1314 15 

i NOTE A: Pins are located within 0,13 (0.005) radius of true position relative to each other at 
maximum material condition and within 0,381 (0.051 ) radius relative to the center of 
the ceramic. 



ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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17 x 17 GA pin grid array ceramic package 
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2.54 (0.100) T P. 
(See Note A) 



ALL POSSIBLE PIN LOCATIONS ARE 
SHOWN. SEE APPLICABLE PRODUCT 
DATA SHEETS FOR ACTUAL PIN 
LOCATIONS USED. 



1 2 3 4 5 6 7 8 9 1011121314151617 



NOTE A: Pins are located within 0,13 (0.005) radius of true position relative to each other at 
meximum material condition and within 0,381 (0.051 ) radius relative to the center 
of the ceramic. 



ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 



9-11 



17 x 17 GB pin grid array ceramic package 
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2,54 (0.100) T.P. 
(See Note A) 



ALL POSSIBLE PIN LOCATIONS ARE 
SHOWN. SEE APPLICABLE PRODUCT 
DATA SHEETS FOR ACTUAL PIN 
LOCATIONS USED. 



1 2 3 4 5 6 7 8 9 1011121314151617 



NOTE A: Pins are located within 0,13 (0.005) radius of true position relative to each other at 
maximum material condition and within 0,381 (0.051) radius relative to the center of 
the ceramic. 



ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 



TI Sales Offices 



ALABAMA: Huntsville (205) 837-7530. 

ARIZONA: Phoenix (602) 995-1007; 
Tucson (602) 624-3276. 

CALIFORNIA: Irvine (714) 660-1200; 
Sacramento (916) 929-0197; 
San Diego (619) 278-9600; 
Santa Clara (408) 980-9000; 
Torrance (213) 217-7000; 
Woodland Hills (818) 704-7759. 

COLORADO: Aurora (303) 368-8000. 

CONNECTICUT: Wallingford (203) 269-0074. 

FLORIDA: Altamonte Springs (305) 260-21 16, 
Ft. Lauderdale (305) 973-8502; 
Tampa (813) 286-0420. 

GEORGIA: Norcross (404) 662-7900. 

ILLINOIS: Arlington Heights (312) 640-3000. 

INDIANA: Carmel (31 7) 573-6400; 
Ft. Wayne (219) 424-5174. 

IOWA: Cedar Rapids (319) 395-9550. 

KANSAS: Overland Park (913) 451-4511. 

MARYLAND: Baltimore (301) 944-8600. 

MASSACHUSETTS: Waltham (617) 895-9100. 

MICHIGAN: Farmington Hills (313) 553-1500; 
Grand Rapids (616) 957-4200. 

MINNESOTA: Eden Prairie (612) 828-9300. 

MISSOURI: St Louis (314) 569-7600. 

NEW JERSEY: Iselin (201) 750-1050. 

NEW MEXICO: Albuquerque (505) 345-2555. 

NEW YORK: East Syracuse (315) 463-9291, 
Melville (516) 454-6600; Pittsford (716) 385-6770: 
Poughkeepsie (914) 473-2900. 

NORTH CAROLINA: Charlotte (704) 527-0930; 
Raleigh (919) 876-2725. 

OHIO: Beachwood (216) 464-6100; 
Dayton (513) 258-3877. 

OREGON: Beaverton (503) 643-6758. 

PENNSYLVANIA: Blue Bell (215) 825-9500. 

PUERTO RICO: Hato Rey (809) 753-8700. 

TENNESSEE: Johnson City (615) 461-2192. 

TEXAS: Austin (512) 250-6769; 

Houston (713) 778-6592; Richardson (214) 680-5082; 

San Antonio (512) 496-1779. 

UTAH: Murray (801) 266-8972. 

VIRGINIA: Fairfax (703) 849-1400. 

WASHINGTON: Redmond (206) 881-3080. 

WISCONSIN: Brookfield (414) 782-2899. 

CANADA: Nepean, Ontario (613) 726-1970; 
Richmond Hill, Ontario (416) 884-9181; 
St. Laurent, Quebec (514) 336-1860. 

TI Regional 
Technology Centers 

CALIFORNIA: Irvine (714) 660-8140; 
Santa Clara (408) 748-2220; 
Torrance (213) 217-7019. 

COLORADO: Aurora (303) 368-8000. 

GEORGIA: Norcross (404) 662-7945. 

ILLINOIS Arlington Heights (313) 640-2909. 

MASSACHUSETTS: Waltham (617) 895-9196. 

TEXAS: Richardson (214) 680-5066. 

CANADA: Nepean, Ontario (613) 726-1970. 



TI Distributors 



TI AUTHORIZED DISTRIBUTORS 
Arrow/Kierulff Electronics Group 
Arrow Canada (Canada) 
Future Electronics (Canada) 
GRS Electronics Co., Inc. 
Hall-Mark Electronics 
Marshall Industries 
Newark Electronics 
Schweber Electronics 
Time Electronics 
Wyle Laboratories 
Zeus Components 

-OBSOLETE PRODUCT ONLY- 
Rochester Electronics, Inc. 
Newburyport, Massachusetts 
(617) 462-9332 



ALABAMA: Arrow/Kierulff (205) 837,6955; 
Hall-Mark (205) 837-8700; Marshall (205) 881-9235; 
Schweber (205) 895-0480. 

ARIZONA: Arrow/Kierulff (602) 437-0750, 
Hall-Mark (602) 437-1200; Marshall (602) 496-0290; 
Schweber (602) 997-4874; Wyle (602) 866-2888. 

CALIFORNIA: Los Angeles/Orange County: 

Arrow/Kierulff (818) 701-7500. (714) 838-5422; 

Hall-Mark (818) 716-7300, (714) 669-4100, 

(213) 217-8400; Marshall (818) 407-0101, (818) 459-5500, 

(714) 458-5395; Schweber (818) 999-4702; 

(714) 863-0200, (213) 320-8090; Wyle (213) 322-9953, 

(818) 880-9000, (714) 863-9953; Zeus (714) 921-9000; 

Sacramento: Hall-Mark (916) 722-8600; 

Marshall (916) 635-9700; Schweber (916) 929-9732; 

Wyle (916) 638-5282; 

San Diego: Arrow/Kierulff (619) 565-4800; 

Hail-Mark (619) 268-1201; Marshall (619) 578-9600; 

Schweber (619) 450-0454; Wyle (619) 565-9171; 

San Francisco Bay Area: Arrow/Kierulff (408) 745-6600, 

Hall-Mark (408) 432-0900; Marshall (408) 942-4600; 

Schweber (408) 432-7171 ; Wyle (408) 727-2500; 

Zeus (408) 998-5121. 

COLORADO: Arrow/Kierulff (303) 790-4444; 
Hall-Mark (303) 790-1662; Marshall (303) 451-8383; 
Schweber (303) 799-0258; Wyle (303) 457-9953. 

CONNETICUT: Arrow/Kierulff (203) 265-7741; 
Hall-Mark (203) 269-0100; Marshall (203) 265-3822; 
Schweber (203) 748-7080. 

FLORIDA: Ft. Lauderdale: 

Arrow/Kierulff (305) 429-8200; Hall-Mark (305) 971-9280; 
Marshall (305) 977-4880; Schweber (305) 977-751 1 ; 
Orlando: Arrow/Kierulff (305) 725-1480, (305) 682-6923; 
Hall-Mark (305) 855-4020; Marshall (305) 767-8585; 
Schweber (305) 331-7555; Zeus (305) 365-3000; 
Tampa: Hall-Mark (813) 530-4543; 
Marshall (813) 576-1399. 

GEORGIA: Arrow/Kierulff (404) 449-8252; 
Hall-Mark (404) 447-8000; Marshall (404) 923-5750; 
Schweber (404) 449-9170. 

ILLINOIS: Arrow/Kierulff (312) 250-0500; 
Hall-Mark (312) 860-3800; Marshall (312) 490-0155; 
Newark (312) 784-5100; Schweber (312) 364-3750. 

INDIANA: Indianapolis: Arrow/Kierulff (3 17) 243-9353; 
Hall-Mark (317) 872-8875; Marshall (317) 297-0483. 

IOWA: Arrow/Kierulff (319) 395-7230: 
Schweber (319) 373-1417. 

KANSAS: Kansas City: Arrow/Kierulff (913) 541-9542; 
Hall-Mark (913) 888^4747; Marshall (913) 492-3121; 
Schweber (913) 492-2922. 

MARYLAND: Arrow/Kierulff (301) 995-6002; 
Hall-Mark (301) 988-9800; Marshall (301) 840-9450; 
Schweber (301) 840-5900; Zeus (301) 997-1118. 



Texas 
Instruments 



MASSACHUSETTS Arrow/Kierulff (617) 935-5134; 
Hall-Mark (617) 667-0902; Marshall (617) 658-0810; 
Schweber (617) 275-5100, (617) 657-0760; 
Time (617) 532-6200; Zeus (617) 863-8800. 

MICHIGAN: Detroit: Arrow/Kierulff (313) 971-8220; 
Marshall (313) 525-5850; Newark (313) 967-0600; 
Schweber (313) 525-8100; 
Grand Rapids: Arrow/Kierulff (616) 243-0912. 

MINNESOTA: Arrow/Kierulff (612) 830-1800; 
Hall-Mark (612) 941-2600; Marshall (612) 559-2211; 
Schweber (612) 941-5280. 

MISSOURI: St. Louis: Arrow/Kierulff (314) 567-6888; 
Hall-Mark (314) 291-5350; Marshall (314) 291-4650; 
Schweber (314) 739-0526. 

NEW HAMPSHIRE: Arrow/Kierulff (603) 668-6968; 
Schweber (603) 625-2250. 

NEW JERSEY: Arrow/Kierulff (201) 538-0900, 
(609) 596-8000; GRS Electronics (609) 964-8560; 
Hall-Mark (201) 575-4415, (609) 235-1900; 
Marshall (201) 882-0320, (609) 234-9100; 
Schweber (201) 227-7880. 

NEW MEXICO: Arrow/Kierulff (505) 243-4566. 

NEW YORK: Long Island: 

Arrow/Kierulff (516) 231-1000; Hall-Mark (516) 737-0600; 

Marshall (516) 273-2424; Schweber (516) 334-7555; 

Zeus (914) 937-7400; 

Rochester: Arrow/Kierulff (716) 427-0300; 

Hall-Mark (716) 244-9290; Marshall (716) 235-7620; 

Schweber (716) 424-2222; 

Syracuse: Marshall (607) 798-161 1. 

NORTH CAROLINA: Arrow/Kierulff (919) 876-3132, 
(919) 725-8711; Hall-Mark (919) 872-0712; 
Marshall (919) 878-9882; Schweber (919) 876-0000. 

OHIO: Cleveland: Arrow/Kierulff (216) 248-3990; 

Hall-Mark (216) 349-4632; Marshall (216) 248-1788; 

Schweber (216) 464-2970; 

Columbus: Arrow/Kierulff (614) 436-0928; 

Hall-Mark (614) 888-3313; 

Dayton: Arrow/Kierulff (513) 435-5563; 

Marshall (513) 898-4480; Schweber (513) 439-1800. 

OKLAHOMA: Arrow/Kierulff (918) 252-7537; 
Schweber (918) 622-8003. 

OREGON: Arrow/Kierulff (503) 645-6456; 
Marshall (503) 644-5050; Wyle (503) 640-6000. 

PENNSYLVANIA: Arrow/Kierulff (412) 856-7000, 
(215) 928-1800; GRS Electronics (215) 922-7037; 
Schweber (215) 441-0600, (412) 963-6804, 

TEXAS: Austin; Arrow/Kierulff (512) 835-4180; 
Hall-Mark (512) 258-8848; Marshall (512) 837-1991; 
Schweber (512) 339-0088; Wyle (512) 834-9957; 
Dallas: Arrow/Kierulff (214) 380-6464; 
Hall-Mark (214) 553-4300; Marshall (214) 233-5200; 
Schweber (214) 661-5010; Wyle (214) 235-9953; 
Zeus (214) 783-7010; 
Houston: Arrow/Kierulff (713) 530-4700; 
Hall-Mark (713) 781-6100; Marshall (713) 895-9200; 
Schweber (713) 784-3600; Wyle (713) 879-9953. 

UTAH: Arrow/Kierulff (801) 973-6913; 

Hall-Mark (801) 972-1008; Marshall (801) 485-1551; 

Wyle (801) 974-9953. 

WASHINGTON: Arrow/Kierulff (206) 575-4420; 
Marshall (206) 747-9100; Wyle (206) 453-8300. 

WISCONSIN: Arrow/Kierulff (414) 792-0150; 
Hall-Mark (414) 797-7844; Marshall (414) 797-8400; 
Schweber (414) 784-9020. 

CANADA: Calgary: Future (403) 235-5325; 

Edmonton: Future (403) 438-2858; 

Montreal: Arrow Canada (514) 735-551 1 ; 

Future (514) 694-7710; 

Ottawa: Arrow Canada (613) 226-6903; 

Future (613) 820-8313; 

Quebec City: Arrow Canada (418) 687-4231; 

Toronto: Arrow Canada (416) 672-7769; 

Future (416) 638-4771; 

Vancouver: Future (604) 294-1166; 

Winnipeg: Future (204) 339-0554. 

Customer 
Response Center 

TOLL FREE: (800) 232-3200 

OUTSIDE USA: (214) 995-6611 

(8.00 a.m. - 5:00 p.m. CST) 



TI Worldwide 
Sales Offices 



ALABAMA: Huntsville: 500 Wynn Drive, Suite 514. 
Huntsville, AL 35805, (205) 837-7530. 

ARIZONA: Phoenix: 8825 N. 23rd Ave.. Phoenix. 
AZ 85021, (602) 995-1007. 

CALIFORNIA: Irvine: 17891 Cartwright Rd., Irvine, 
CA 92714, (714) 660-8187; Sacramento: 1900 Point 
West Way, Suite 171, Sacramento, CA 95815, 
(916) 929-1521; San Diego: 4333 View Ridge Ave.. 
Suite B., San Diego, CA 92123, (619) 278-9601; 
Santa Clara: 5353 Betsy Ross Dr., Santa Clara. CA 
95054, (408) 980-9000; Torrance: 690 Knox St., 
Torrance, CA 90502, (213) 217-7010; 
Woodland Hills: 21220 Erwin St., Woodland Hills, 
CA 91367, (818) 704-7759. 

COLORADO: Aurora: 1400 S. Potomac Ave., 
Suite 101, Aurora, CO 80012, (303) 368-8000. 

CONNECTICUT: Wallingford: 9 Barnes Industrial 
Park Rd., Barnes Industrial Park, Wallingford, 
CT 06492, (203) 269-0074. 

FLORIDA: Ft. Lauderdale: 2765 N.W. 62nd St., 
Ft. Lauderdale, FL 33309, (305) 973-8502; 
Maitland: 2601 Maitland Center Parkway, 
Maitland, FL 32751, (305) 660-4600; 
Tampa: 5010 W. Kennedy Blvd., Suite 101 
Tampa, FL 33609, (813) 870-6420. 

GEORGIA: Norcross: 5515 Spalding Drive, Norcross, 
GA 30092, (404) 662-7900 

ILLINOIS: Arlington Heights: 515 W. Algonquin, 
Arlington Heights, IL 60005, (312) 640-2925. 

INDIANA: Ft. Wayne: 2020 Inwood Dr., Ft. Wayne. 
,IN 46815, (219) 424-5174; 
Indianapolis: 2346 S. Lynhurst, Suite J-400. 
Indianapolis, IN 46241, (317) 248-8555. 

IOWA: Cedar Rapids: 373 Collins Rd. NE, Suite 200, 
Cedar Rapids, IA 52402, (319) 395-9550. 

MARYLAND: Baltimore: 1 Rutherford PI., 
7133 Rutherford Rd., Baltimore, MD 21207, 
(301) 944-8600. 

MASSACHUSETTS: Waltham: 504 Totten Pond Rd., 
Waltham, MA 02154, (617) 895-9100. 

MICHIGAN: Farmington Hills: 33737 W. 12 Mile Rd., 
Farmington Hills, Ml 48018, (313) 553-1500. 

MINNESOTA: Eden Prairie: 11000 W. 78th St., 
Eden Prairie, MN 55344 (612) 828-9300. 

MISSOURI: Kansas City: 8080 Ward Pkwy.. Kansas 

City, MO 64114, (816) 523-2500; 

St. Louis: 11816 Borman Drive, St, Louis, 

MO 63146, (314) 569-7600. 

NEW JERSEY: Iselin: 485E U.S. Route 1 South, 
Parkway Towers, Iselin, NJ 08830 (201) 750-1050 

NEW MEXICO: Albuquerque: 2820-D Broadbent Pkwy 
NE, Albuquerque, NM 87107, (505) 345-2555. 

NEW YORK: East Syracuse: 6365 Collamer Dr., East 
Syracuse, NY 13057, (315) 463-9291; 
Endicott: 112 Nanticoke Ave., P.O. Box 618, Endicott 
NY 13760, (607) 754-3900; Melville: 1 Huntington 
Quadrangle, Suite 3C10, P.O. Box 2936, Melville, 
NY 11747, (516) 454-6600; Pittsford: 2851 Clover St., 
Pittsford, NY 14534, (716) 385-6770; 
Poughkeepsie: 385 South Rd., Poughkeepsie, 
NY 12601, (914) 473-2900. 

NORTH CAROLINA: Charlotte: 8 Woodlawn Green, 
Woodlawn Rd., Charlotte, NC 28210, (704) 527-0930; 
Raleigh: 2809 Highwoods Blvd., Suite 100, Raleiqh. 
NC 27625, (919) 876-2725. 

OHIO: Beachwood: 23408 Commerce Park Rd., 
Beachwood, OH 44122, (216) 464-6100. 
Dayton: Kingsley Bldg., 4124 Linden Ave., Dayton. 
OH 45432, (513) 258-3877. 

OREGON: Beaverton: 6700 SW 105th St.. Suite 110. 
Beaverton, OR 97005, (503) 643-6758 



PENNSYLVANIA: Ft. Washington: 260 New York Dr . 
Ft. Washington. PA 19034, (215) 643-6450; 
Coraopolis: 420 Rouser Rd., 3 Airport Office Park. 
Coraopolis. PA 15108. (412) 771-8550. 

PUERTO RICO: Hato Rey: Mercantil Plaza Bldg.. 
Suite 505. Hato Rey. PR 00919, (809) 753-8700 

TEXAS: Austin: P.O. Box 2909, Austin, TX 78769. 
(512) 250-7655; Richardson: 1001 E. Campbell Rd.. 
Richardson, TX 75080, 

(214) 680-5082; Houston: 9100 Southwest Frwy.. 
Suite 237, Houston, TX 77036. (713) 778-6592. 
San Antonio: 1000 Central Parkway South. 
San Antonio. TX 78232, (512) 496-1779. 

UTAH: Murray: 5201 South Green SE. Suite 200. 
Murray, UT 84107. (801) 266-8972. 

VIRGINIA: Fairfax: 2750 Prosperity. Fairfax. VA 
22031, (703) 849-1400. 

WASHINGTON: Redmond: 5010 148th NE, Bldg B. 
Suite 107, Redmond, WA 98052, (206) 881-3080. 

WISCONSIN: Brookfield: 450 N. Sunny Slope. 
Suite 150, Brookfield, Wl 53005, (4.14) 785-7140. 

CANADA: Nepean: 301 Moodie Drive, Mallorn 

Center, Nepean, Ontario, Canada, K2H9C4, 

(613) 726-1970. Richmond Hill: 280 Centre St. E.. 

Richmond Hill L4C1B1, Ontario, Canada 

(416) 884-9181; St. Laurent: Ville St. Laurent Quebec, 

9460 Trans Canada Hwy., St. Laurent, Quebec, 

Canada H4S1R7, (514) 335-8392. 



ARGENTINA: Texas Instruments Argentina 

S.A.I. C.F.: Esmeralda 130, 15th Floor, 1035 Buenos 

Aires, Argentina, 1 +394-3008. 

AUSTRALIA (& NEW ZEALAND): Texas Instruments 
Australia Ltd.. 6-10 Talavera Rd., North Ryde 
(Sydney), New South Wales, Australia 2113, 
2 + 887-1122; 5th Floor, 418 St. Kilda Road, 
Melbourne, Victoria, Australia 3004, 3 + 267-4677; 
171 Philip Highway, Elizabeth, South Australia 5112, 
8 + 255-2066. 

AUSTRIA: Texas Instruments Ges.m.b.H.: 
Industriestrabe B/16. A-2345 Brunn/Gebirge, 
2236-846210. 

BELGIUM: Texas Instruments N.V. Belgium S.A.: 
Mercure Centre, Raketstraat 100, Rue de la Fusee, 
1130 Brussels, Belgium, 2/720.80.00. 

BRAZIL: Texas Instruments Electronicos do Brasil 
Ltda.: Rua Paes Leme, 524-7 Andar Pinheiros, 05424 
Sao Paulo, Brazil, 0815-6166. 

DENMARK: Texas Instruments A/S, Mairelundvej 
46E, DK-2730 Herlev, Denmark, 2 - 91 74 00. 

FINLAND: Texas Instruments Finland OY: 
Teollisuuskatu 19D 00511 Helsinki 51, Finland, (90) 
701-3133. 

FRANCE: Texas Instruments France: Headquarters 
and Prod. Plant, BP 05, 06270 Villeneuve-Loubet, 
(93) 20-01-01; Paris Office, BP 67 8-10 Avenue 
Morane-Saulnier, 78141 Velizy-Villacoublay, 
(3) 946-97-12; Lyon Sales Office, L'Oree D'Ecully, 
Batiment B, Chemin de la Forestiere, 69130 Ecully, 
(7) 833-04-40; Strasbourg Sales Office, Le Sebastopol 
3, Quai Kleber, 67055 Strasbourg Cedex, 
(88) 22-12-66; Rennes, 23-25 Rue du Puits Mauger, 
35100 Rennes, (99) 31-54-86; Toulouse Sales Office, 
Le Peripole— 2, Chemin du Pigeonnier de la Cepiere, 
31100 Toulouse, (61) 44-18-19; Marseille Sales Office, 
Noilly Paradis— 146 Rue Paradis, 13006 Marseille, 
(91) 37-25-30. 



Texas 
Instruments 



GERMANY (Fed. Republic of Germany): Texas 
Instruments Deutschland GmbH: Haggertystrasse 1, 
D-8050 Freising, 8161 +80-4591; Kurfuerstendamm 
195/196, D-1000 Berlin 15, 30 + 882-7365; III, Hagen 
43/Kibbelstrasse. .19, D-4300 Essen, 201-24250; 
Frankfurter Allee 6-8, D-6236 Eschborm 1, 
06196 + 8070; Hamburgerstrasse 11, D-2000 Hamburg 
76, 040 + 220-1 154, Kirchhorsterstrasse 2, D-3000 
Hannover 51, 511 +648021; Maybachstrabe 11. 
D-7302 Ostfildern 2-Nelingen, 711 +547001; 
Mixikoring 19, D-2000 Hamburg 60, 40+637+0061; 
Postfach 1309, Roonstrasse 16, D-5400 Koblenz, 
261 +35044. 

HONG KONG (+ PEOPLES REPUBLIC OF CHINA): 

Texas Instruments Asia Ltd., 8th Floor, World 
Shipping Ctr., Harbour City, 7 Canton Rd., Kowloon, 
Hong Kong. 3 + 722-1223. 

IRELAND: Texas Instruments (Ireland) Limited; 
Brewery Rd., Stillorgan, County Dublin, Eire, 

1 831311. 

ITALY: Texas Instruments Semiconduttori Italia Spa: 
Viale Delle Scienze, 1, 02015 Cittaducale (Rieti), 
Italy, 746 694.1; Via Salaria KM 24 (Palazzo Cosma), 
Monterotondo Scalo (Rome), Italy, 6 + 9003241; Viale 
Europa, 38-44, 20093 Cologho Monzese (Milano), 

2 2532541; Corso Svizzera, 185, 10100 Torino, Italy, 
11 774545; Via J. Barozzi 6, 40100 Bologna, Italy, 51 
355851. 

JAPAN: Texas Instruments Asia Ltd.; 4F Aoyama 
Fuji Bldg., 6-12, Kita Aoyama 3-Chome, Minato-ku, 
Tokyo, Japan 107, 3-498-2111; Osaka Branch, 5F, 
Nissho Iwai Bldg., 30 Imabashi 3- Chome, 
Higashi ku, Osaka, Japan 541, 06-204-1881; Nagoya 
Branch, 7F Daini Toyota West Bldg., 10-27, Meieki 
4-Chome, Nakamura-ku Nagoya, Japan 
450, 52-583-8691. 

KOREA: Texas Instruments Supply Co.: 3rd Floor, 
Samon Bldg., Yuksam-Dong, Gangnam-ku, 
135 Seoul, Korea, 2 + 462-8001. 

MEXICO: Texas Instruments de Mexico S.A.: Mexico 
City, AV Reforma No. 450 — 10th Floor, Mexico, 
D.F., 06600, 5+514-3003. 

MIDDLE EAST: Texas Instruments: No. 13, 1st Floor 
Mannai Bldg., Diplomatic Area, P.O. Box 26335, 
Manama Bahrain, Arabian Gulf, 973 + 274681. 

NETHERLANDS: Texas Instruments Holland B.V., 
P.O. Box 12995, (Bullewijk) 1100 CB Amsterdam, 
Zuid-Oost, Holland 20 + 560291 1 . 

NORWAY: Texas Instruments Norway A/S: PB106, 
Refstad 131, Oslo 1, Norway, (2) 155090. 

PHILIPPINES: Texas Instruments Asia Ltd.: 14th 
Floor, Ba- Lepanto Bldg., 8747 Paseo de Roxas, 
Makati, Metro Manila, Philippines, 2 + 8188987. 

PORTUGAL: Texas Instruments Equipamento 
Electronico (Portugal), Lda.: Rua Eng. Frederico 
Ulrich, 2650 Moreira Da Maia, 4470 Maia, Portugal, 
2-948-1003. 

SINGAPORE (+ INDIA, INDONESIA, MALAYSIA, 
THAILAND): Texas Instruments Asia Ltd.: 12 Lorong 
Bakar Batu, Unit 01-02, Kolam Ayer Industrial Estate, 
Republic of Singapore, 747-2255. 

SPAIN: Texas Instruments Espana, S.A.; C/Jose 
Lazaro Galdiano No. 6, Madrid 16, 1/458.14.58. 

SWEDEN: Texas Instruments International Trade 
Corporation (Sverigefilialen): Box 39103, 10054 
Stockholm, Sweden, 8 - 235480. 

SWITZERLAND: Texas Instruments, Inc., Reidstrasse 
6, CH-8953 Dietikon (Zuerich) Switzerland, 
1-740 2220. 

TAIWAN: Texas Instruments Supply Co.: Room 903, 
205 Tun Hwan Rd:, 71 Sung-Kiang Road, Taipei. 
Taiwan, Republic of China, 2 + 521-9321. 

UNITED KINGDOM: Texas Instruments Limited: 
Manton Lane, Bedford, MK41 7PA, England, 0234 
67466; St. James House, Wellington Road North, 
Stockport, SK4 2RT, England, 61 +442-7162. 



Erratum to SN74ACT8837 User's Guide 



Format conversion from floating point to integer for numbers 
between 1.111 ... 111 x 231 and 232, when rounding up, 
causes the resulting integer to go to zero without causing the 
device to signal an overflow. This condition occurs whenever 
rounding causes the integer operand to be incremented. If the 
user anticipates that this condition may occur in a system being 
designed, a software trap can be implemented to monitor correct 
operation of the format conversion. 
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